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Description 
Background 

s The present invention relates generally to the manipulation ot genetic materials and. more particularly, to recom- 

binant procedures making possible the production of polypeptides possessing part or all of the primary structural con- 
formation. 

A. Manipulation Of Genetic Materials 

10 

Genetic nriaterials may be broadly defined as those chemical substances which program for and guide the manu- 
facture of constituents of cells and viruses and direct the responses of cells and viruses. A long chain polymeric sub- 
stance known as deoxyribonucleic acid (DNA) comprises the genetk; material of all living cells and viruses except for 
certain viruses which are programmed by ribonuciek; acids (RNA). The repeating units in DNA polymers are four 
IS different nucleotides, each of which consists of either a purine (adenine or guanine) or a pyrimkiine (thymine or cytosine) 
bound to a deoxyribose sugar to which a phosphate group is attached. Attachment of nucleotides in linear polymeric 
form is by means of fusion of the 5' phosphate of one nucleotide to the 3' hydroxyl group of another Functbnal DNA 
occurs in the form of stable double stranded associatk>ns of single strands of nucleotides (known as deoxyoligonucle- 
otides), whk^h associations occur by means of hydrogen bonding between purine and pyrimidlne bases [i.e., 'comple- 
te mentary" associations existing between adenine (A) and thymine (T) or guanine (G) and cytosine (C)]. By convention, 
nucleotides are referred to by the names of their constituent purine or pyrimidine bases, and the complementary as- 
sociations of nucleotides in double stranded DNA (i.e., A — T and G — C) are referred to as "base pairs". Ribonucleic 
acid is a polynucleotide comprising adenine, guanine, cytosine and uracil (U), rather than thymine, bound to ribose 
and a phosphate group. 

2S Most briefly put, the programming function of DNA Is generally effected through a process wherein specific DNA 

nucleotide sequences (genes) are transcribed" into relatively unstable messenger RNA (mRNA) polymers. The mRNA, 
in turn, serves as a template for the formation of structural, regulatory and catalytic proteins from amino acids. This 
mRNA "translation" process involves the operations of small RNA strands (tRNA) which transport and align individual 
amino acids along the mRNA strand to allow for formation of polypeptides in proper amino acW sequences. The mRNA 

30 "message", derived from DNA and providing the basis for the tRNA supply and orientation of any given one of the 
twenty amino acids for polypeptide "expression", is in the form of triplet "codons" — sequential groupings of three 
nucleotkie bases. In one sense, the formation of a protein is the ultimate form of "expression" of the programmed 
genetic message provided by the nucleotide sequence of a gene. 

■ProrTK>ter* DNA sequences usually "precede" a gene in a DNA polymer and provide a site for initiation of the 

55 transcription into mRNA. "Regulator" DNA sequences, also usually "upstream" of (i.e., preceding) a gene in a given 
DNA polymer, bind proteins that determine the frequency (or rate) ot transcriptional initiation. Collectively referred to 
as "promoter/regulator" or "control" DNA sequence, these sequences which precede a selected gene (or series of 
genes) in a functional DNA polymer cooperate to determine whether the transcription (and eventual expression) of a 
gene will occur DNA sequences which "follow* a gene in a DNA polymer and provkle a signal for termination of the 

40 transcription into mRNA are referred to as transcription "terminator" sequences. 

A focus of microbiological processing for the last decade has been the attempt to manufacture industrially and 
pharmaceutical ty significant substances using organisms which either do not initially have genetically coded information 
concerning the desired product included in their DNA, or (in the case of mammalian ceils in culture) do not ordinarily 
express a chromosomal gene at appreciable levels. Simply put, a gene that specifies the structure of a desired polypep- 

45 tide product Is either isolated from a "donor" organism or chemically synthesized and then stably introduced into another 
organism which is preferably a self-replicating unicellular organism such as bacteria, yeast or mammalian cells in 
culture. Once this is done, the existing machinery for gene expression in the "transfomned" or "transfected" microbial 
host cells operates to construct the desired product, using the exogenous DNA as a template for transcription of mRNA 
which is then translated into a continuous sequence of amino acid residues. 

so The art is rich in patent and literature publk:ations relating to "recombinant DNA" methodologies for the isolation, 

synthesis, purification and amplification of genetic materials for use in the transformatton of selected host organisms. 
U.S. Letters Patent No. 4,237,224 to Cohen, at al., for example, relates to transformation of unicellular host organisms 
with "hybrki* viral or circular plasmid DNA which includes selected exogenous DNA sequences. The procedures of the 
Cohen, at al. patent first involve manufacture of a transformation vector by enzymatically cleaving viral or circular 

55 plasmid DNA to form linear DNA strands. Selected foreign ("exogenous" or "heterologous") DNA strands usually in- 
cluding sequences coding for desired product are prepared in linear form through use of a similar enzymes. The linear 
viral or plasmid DNA is incut>ated with the foreign DNA In the presence of ligating enzymes capable of effecting a 
restoration process and "hybrid" vectors are formed which include the selected exogenous DNA segment "spliced" 
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into the viral or circular DNA plasmid. 

Transformation of compatible unicellular host organisms with the hybrid vector results in the formation of multiple 
copies of the exogenous DNA In the host cell population. In some Instances, the desired result is simply the amplification 
of the foreign DNA and the "product" han^ested is DNA. More frequently, the goal of transformation is the expression 
by the host cells of the exogenous DNA in the form of large scale synthesis of isolatable quantities of commercial ty 
significant protein or polypeptide fragments coded for by the foreign DNA. See also, e.g., U.S. Letters Patent Nos. 
4,264.731 (to Shine), 4,273.875 (to Manis). 4,293,652 (to Cohen), and European Patent Application 093,61 9, published 
November 9, 1983. 

The development of specific DNA sequences for splicing into DNA vectors is accomplished by a variety of tech- 
niques, depending to a great deal on the degree of "foreignness" of the "donor" to the projected host and the size of 
the polypeptide to be expressed in the host. At the risk of over-simplification, it can be stated that three alternative 
principal methods can be employed: (1) the "isolation" of double-stranded DNA sequence from the genomic DNA of 
the donor; (2) the chemical manufacture of a DNA sequence providing a code for a polypeptide of interest; and (3) the 
in vitro synthesis of a double-stranded DNA sequence by enzymatic "reverse transcription" of mRNA isolated from 
donor cells. The last-mentioned methods which involve formation of a DNA "complement" of mRNA are generally 
referred to as "cDNA" methods. 

Manufacture of DNA sequences is frequently the method of choice when the entire sequence of amino acid residues 
of the desired polypeptide product is known. DNA manufacturing procedures of coowned, co-pending U.S. Patent 
Application Serial No. 483,451, by Alton, at al., (filed April 15, 1983 and corresponding to PCT US83/00605, published 
November 25, 1 983 as W083/04053), for example, provide a superior means for accomplishing such highly desirable 
results as: providing for the presence of alternate codons commonly found in genes which are highly expressed in the 
host organism selected for expression (e.g., providing yest or E. co// "preference" codons); avoiding the presence of 
untranslated "Intron" sequences (commonly present in mammalian genomic DNA sequences and mRNA transcripts 
thereof) which are not readily processed by procaryotic host cells; avoiding expression of undesired "leader" polypep- 
tide sequences commonly coded for by genomic DNA and cDNA sequences but frequently not readily cleaved from 
the polypeptide of interest by bacterial or yeast host cells; providing for ready insertion of the DNA in convenient 
expression vectors in associatton with desired promotor/regulator and terminator sequences; and providing for ready 
construction of genes for polypeptide fragments and analogs of the desired polypeptides. 

When the entire sequence of amino acid residues of the desired polypeptide is not known, direct manufacture of 
DNA sequences is not possible and isolation of DNA sequences coding for the polypeptide by a cDNA method becomes 
the method of choice despite the potential drawbacks in ease of assembly of expression vectors capable of providing 
high levels of microbial expression referred to above. Among the standard procedures for isolating cDNA sequences 
of interest Is the preparation of plasmid-borne cDNA "libraries" derived from reverse transcription of mRNA abundant 
in donor cells selected as responsible for high level expression of genes (e.g., libraries of cDNA derived from pituitary 
celts which express relatively large quantities of growth hormone products). Where substantial portions of the polypep- 
tide's amino acid sequence are known, labelled, single-stranded DNA probe sequences duplicating a sequence puta- 
tively present in the "target" cDNA may be employed in DNA/DNA hybridization procedures carried out on cloned 
copies of the cDNA which have been denatured to single stranded form. [See. generally, the disclosure and discussions 
of the art provided in U.S. Patent No. 4,394,443 to Welssman, at al. and the recent demonstrations of the use of long 
oligonucleotide hybridization probes reported in Wallace, at al., Nuc. Acids Res,, 6, pp. 3543 — 3557 (1 979), and Reyes, 
at al., F/V.AS. (U.S.A.), 79. pp. 3270—3274 (1982). and Jaye. at al., Nuc. Acids Res., 11, pp. 2325—2335 (1983). 
See also, U.S. Patent No. 4,358,535 to Falkow. at al., relating to DNA/DNA hybridization procedures in effecting diag- 
nosis; published European Patent Application Nos. 0070685 and 0070687 relating to light-emitting labels on single 
stranded polynucleotide probes; Davis, at al., "A Manual for Genetic Engineering. Advanced Bacterial Genetics". Cold 
Spring Harbor Laboratory, Cold Spring Harbor, N. Y. (1 980) at pp. 55 — 58 and 1 74 — 1 76. relating to colony and plaque 
hybridization techniques; and. New England Nuclear (Boston. Mass.) brochures for "Gene Screen" Hybridization Trans- 
fer Membrane materials providing instruction manuals for the transfer and hybridization of DNA and RNA, Catalog No. 
NEF-972.] 

Among the more significant recent advances in hybridization procedures for the screening of recombinant clones 
is the use of labelled mixed synthetic oligonucleotide probes, each of which is potentially the complete complement of 
a specific DNA sequence in the hybridization sample including a heterogenous mixture of single stranded DNAs and 
RNAs. These procedures are acknowledged to be especially useful in the detection of cDNA clones derived from 
sources which provide extremely low amounts of mRNA sequences for the polypeptide of interest. Briefly put, use of 
stringent hybridization conditions directed toward avokJance of non-specific binding can allow, e.g., for the autoradio- 
graphic visualizatbn of a specific cDNA clone upon the event of hybridization of the target DNA to that single probe 
within the mixture which is its complete complement. See generally, Wallace, at aL, Nuc. Acids Res., 9, pp. 879 — 897 
(1981); Suggs, at aL RN.AS. (U.S.A), 78, pp. 6613—6617 (1981); Choo, at al., Nature, 299, pp. 178—180 (1982); 
KurachI, at al.. PMAS. (U.S. A), 79, pp. 6461—6464 (1982); Ohkubo. at al., RNAS. (U.S.A.), 80, pp. 2196—2200 
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(1983); and Kornbllhtt. at a!.. RN.AS. (U.S.A.), 80. pp. 3218-3222 (1983). In general, the mixed probe procedures of 
Wallace, at al. (1981), supra, have been expanded upon by various workers to the point where reliable results have 
reportedly been obtained In a cDNA clone Isolation using a 32 member mixed "pool" of 16-base-long (16-mer) oligo- 
nucleotide probes of uniformly, varying DNA sequences together with a single 11-mer to effect a two-site "positive" 
5 confirmation of the presence of cDNA of interest. See. Singer-Sam. at al. , P.N. A. S. (U.S. A), 80. pp. 802 — 806 (1 983). 
The use of genomic DNA isolates is the least common of the three above-noted methods for developing specific 
DNA sequences for use in recombinant procedures. This is especially true in the area of recombinant procedures 
directed to securing microbial expression of mammalian polypeptides and is due, principally to the complexity of mam- 
malian genomic DNA. Thus, while reliable procedures exist for developing phage-borne libraries of genomic DNA of 

10 human and other mammalian species origins [See, e.g., Lawn, at al. Cell, 15, pp. 1157—1174 (1978) relating to pro- 
cedures for generating a human genomic library commonly referred to as the "Maniatis Library"; Karn. at al., RN.A.S. 
(U S.A), 77, pp 51 72 — 51 76 (1 980) relating to a human genomic library based on alternative restriction endonuclease 
fragmentation procedure; and Blattner. at al., Science, 196, pp. 161 — 169 (1977) describing construction of a bovine 
genomic library) there have been relatively few successful attempts at use of hybridization procedures in isolating 

IS genomic DNA in the absence of extensive foreknowledge of amino acid or DNA sequences. As one example, Fiddes, 
at al.. etal., J. Mol, andApp. Genetics, 1, pp. 3^18 (1981) report the successful isolation of a gene coding for the 
alpha subunit of the human pituitary glycoprotein hormones from the Maniatis Library through use of a "full length" 
probe including a complete 621 base pair fragment of a previously-isolated cDNA sequence for the alpha subunit. As 
another example, Das, at al., RN.AS. (U.S.A.), 80, pp. 1531-1535 (1983) report isolation of human genomrc clones 

20 for human HLA— DR using a 175 base pair synthetic oligonucleotide. Finally, Anderson, at al., RN.AS. (U.S.A.), 80, 
pp. 6838 — 6842 (1983) report the isolation of genomic ctone.for bovine pancreatic trypsin inhibitor (BPTI) using a 
single probe 86 base pairs in length and constructed according to the known amino ackJ sequence of BPTI. The authors 
note a determination of poor prospects for isolating mRNA suitable for synethesis of a cDNA library due to apparent 
low levels of mRNA in initially targeted carotid gland and lung tissue sources and then address the prospects of success 

2S in probing a genomic library using a mixture of labelled probes, stating: "More generally, mixed-sequence oligodeox- 
ynucleotide probes have been used to isolate protein genes of unknown sequence from cDNA libraries. Such probes 
are typically mixtures of 8 — 32 oligonucleotides, 14 — 17 nucleotides in length, representing every possible codon 
combination for a small stretch (5 — 6 residues) of amino acid sequence. Under stringent hybridization conditions that 
discriminate against Incorrectly base-paired probes, these mixtures are capable of locating specific gene sequences 

30 in clone libraries of low-to-moderate complexity. Nevertheless, because of their short length and heterogeneity, mixed 
probes often lack the specificity required for probing sequences as complex as a mammalian genome. This makes 
such a method impractical for the isolation of mammalian protein genes when the corresponding mRN As are unavail- 
able." (Citattons omitted). 

There thus continues to exist a need in the art for improved methods for effecting the rapid and efficient Isolation 
3S of cDNA clones in instances where little Is known of the amino acid sequence of the polypeptide coded for and where 
"enriched" tissue sources of mRNA are not readily available for use In constructing cDNA libraries. Such improved 
methods would be especially useful if they were applk;able to isolating mammalian genomic clones where sparse 
information is available concerning amino acid sequences of the polypeptide coded for by the gene sought. 

40 B. Erythropoietin As A Polypeptide Of Interest 

Erythropoiesis, the production of red blood cells, occurs continuously throughout the human life span to offset cell 
destruction. Erythropoiesis is a very precisely controlled physiological mechanism enabling sufficient numbers of red 
blood cells to be available in the blood for proper tissue oxygenation, but not so many that the cells woukJ impede 
45 circulation. The formatbn of red blood cells occurs in the bone marrow and is under the control of the hormone, eryth- 
ropoietin. 

Erythropoietin, an acidic glycoprotein of approximately 34.000 dalton molcular weight, may occur in three forms: 
a, p and asialo. The a and p forms differ slightly in carbohydrate components, but have the same potency, biological 
activity and molecular weight. The aslato form is an a and p form with the terminal carbohydrate (sialic acid) removed. 

50 Erythropoietin is present in very low concentrations in plasma when the body is in a healthy state wherein tissues 
receive sufffcient oxygenalton from the existing number of erythrocytes. This normal low concentration is enough to 
stimulate replacement of red bk>od cells which are tost normally through aging. 

The amount of erythropoietin in the circulation is increased under conditions of hypoxia when oxygen transport by 
blood cells in the circulation is reduced. Hypoxia may be caused by loss of large amounts of blood through hemorrhage. 

55 destruction of red blood cells by over-exposure to radiation, reduction in oxygen intake due to high altitudes or prolonged 
unconsciousness, or various forms of anemia. In response to tissues undergoing hypoxic stress, erythropoietin will 
increase red blood cell production by stimulating the converston of primitive precursor cells in the bone marrow into 
proerythroblasts which subsequently mature, synthesize hehiogtobin and are released into the circulation as red blood 
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cells. When the number of red blood cells in circulation is greater than needed for normal tissue oxygen requirements, 
erythropoietin in circulation is decreased. 

See generalty, Testa, at aL, Exp. HematoL, 8(Supp: 8), 144—152 (1980); Tong, at al., J. Biol. Chem., 256(24), 
1266—12672 (1981); Goldwasser. J. Cell. Physiol.. 110(Supp. 1), 133—135 (1982); Finch. Blood, 60(6), 1241—1246 

(1982) ; Sytowski, at a!., ExpL HematoL, 8(Supp 8), 52—64 (1980); Naughton. Ann, Clin. Lab. Sci., 13(5), 432—438 

(1983) ; Weiss, at al., Am. J. Vet. Res., 44(10), 1832—1835 (1983); Uppin. at al., Exp HematoL, 11(7), 661—666 
(1983); Baciu, at a!.. Ann N. Y. Acad. Set., 414, 66—72 (1983); Murphy, at aL. Acta. Haeimttologica Japonica, 46(7), 
1380—1396 (1983); Dessypris. at al.. Brit. J. Haematol, 56. 295—306 (1984); and. Emmanouel, atal., Am. J. Physiol. 
247 (1 Pt 2), F1 68—76 (1 984). 

Because erythropoietin is essential in the process of red blood cell formation, the hormone has potential useful 
application in both the diagnosis and the treatment of blood disorders characterized by low or defective red blood cell 
production. See, generally, Pennathur-Das. at al., Blood, 63(5), 1168—71 (1984) and Haddy, Am. Jour Ped. HematoL/ 
Oncol., 4, 191 — 196, (1982) relating to erythropoietin in possible therapies for sickle cell disease, and Eschbach, at 
al. J. Clin. Invest., 74(2), pp. 434 — 441 , (1 984), describing a therapeutic regimen for uremia sheep based on in viva 
response to erythropoietin-rich plasma Infusions and proposing a dosage of 10 U EPO/kg per day for 15 — 40 days as 
corrective of anemia of the type associated with chronic renal failure. See also. Krane. Henry Ford Hops. Med. J., 31 
(3),^77—^B^ (1983). 

It has recently been estimated that the availability of erythropoietin in quantity would allow for treatment each year 
of anemias of 1 ,600,000 persons in the United States alone. See. e.g.. Morrison, "Bbprocessing in Space — an Over- 
view", pp. 557—571 in The World Biotech Report 1984. Volume 2: USA, (Online Publicatbns, New York, N.Y, 1984). 
Recent studies have provided a basis for projection of efficacy of erythropoietin therapy in a variety of disease states, 
disorders and state of hematologic Irregularity; Vedovato, at aL, Acta. Haematol, 71, 21 1 — 21 3 (1 984) (beta-thalassem- 
ia); VIchinsky, at al., J. Pediatr. 105(1), 15—21 (1984) (cystic fibrosis); Cotes, at al.. Brit. J. Obstet. GyneacoL 90(4), 
304 — 311 (1983) (pregnancy, menstrual disorders); Haga, et aL, Acta. Pediatr. Scand., 72, 827 — 831 (1983) (early 
anemia of prematurity); Claus-Walker, at al., Arch. Phys. Med. RehabiL, 65, 370 — 374 (1984) (spinal cord injury); 
Dunn. atal.. Eur. J. AppL Physiol., 52, 178—182 (1984), (space flight); Miller, at al.. Brit J. HaematoL, 52., 545—590 
(1982) (acute blood toss); Udupa, at al.. J. Lab. Clin. Med., 103(4), 574—580 and 581—588 (1984); and Lipschitz. at 
al.. Blood, 63(3), 502—509 (1983) (aging); and Dainiak. at al., Cancer, 51(6), 1101—1106 (1983) and Schwartz, al 
al„ Otolaryngol, 109, 269—272 (1 983) (various neoplastic disease states accompanied by abnormal erythropoiesis). 

Prior attempts to obtain erythropoietin in good yield from plasma or urine have proven relatively unsuccessful. 
Complicated and sophisticated laboratory techniques are necessary and generally result in the collection of very small 
amounts of impure and unstable extracts containing erythropoietin. 

U.S. Letters Patent No. 3,033,753 describes a method for partially purifying erythropoietin from sheep blood plasma 
whrch provides tow yields of a crude solid extract containing erythropoietin. 

Initial attempts to isolate erythropoietin from urine yielded unstable, biologically inactive preparations of the hor- 
mone. U.S. Letters Patent No. 3,865,801 describes a method of stabilizing the biological activity of a crude substance 
containing erythropoietin recovered from urine. The resulting cojde preparation containing erythropoietin purportedly 
retains 90% of erythropoietin activity, and is stable. 

Another method of purifying human erythropoietin f rbm urine of patients with aplastic anemia Is described in Miyake, 
at al.. J. BioL Chem., Vol. 252. No. 15 (August 10, 1977). pp. 5558—5564. This seven-step procedure includes ion 
exchange chromatography, ethanol precipitation, gel filtration, and adsorption chromatography, and yields a pure eryth- 
ropoietin preparation with a potency of 70,400 units/mg of protein in 21% yield. 

U.S. Letters Patent No. 4.397,840 to Takezawa, at al.. describes methods of preparing "an erythropoietin product" 
from healthy human urine specimens with weakly basic ion exchangers and proposes that the low molecular weight 
products obtained "have no inhibitory effects against erythropoietin. 

U.K. Patent Application No. 2,085.887 by Suglmoto. at al.. published May 6, 1982. describes a process for the 
production of hybrid human lymphoblastoid cells, reporting production levels ranging from 3 to 420 Units of erythro- 
poietin per ml of suspension of cells (distributed into the cultures after mammalian host propagation containing up to 
10^ cells per ml. At the highest production levels asserted to have been obtained, the rate of erythropoietin production 
could be calculated to be 40 to about 4,000 Units/10^ cells/48 hours in in vitro culture following transfer of cells from 
in Vfvo propagation systems. (See also the equivalent U.S. Letters Patent No. 4.377,513.) Numerous proposals have 
been made for isolation of erythropoietin from tissue sources, including neoplastic cells, but the yields have been quite 
low. See. e.g., Jelknr>an. et al.. Expt. HematoL, 11(7), 581—588 (1983); Tambourin, at al., RN.A.S. (U.S.A.), 80, 
6269—6273 (1983); Katsuoka. at al.. Gann. 74, 534—541 (1983); Hagiwara. at al., Blood, 63(4), 828—835 (1984); 
and Choppin. at al., Blood, 64(2), 341—347 (1984). 

Other isolation techniques utilized to obtain purified erythropoietin involve immunological procedures. A polyclonal, 
serum-derived antibody directed against erythropoietin is developed by injecting an animal, preferably a rat or rabbit, 
with human erythropoietin. The injected human erythropoietin Is recognized as a foreign antigento substance by the 
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cjuy— .jio (1981), Ross, at al., Nature, 294, 654—656 (1981)* Walter at al PA/4 <? /// a a x 7v chot l^o,^r^ 
(1980); Lemer. at al.. PMAS. (U.S.A.). 78. 3403-^07 (!981);^'vlStlTaTa1 '^/^tf ["|^f 75 

1O0 ^W{y^B2),an6Lemex, Scientific American, 248, Uo 2 66— 74n983^ See akn i^aicor ^toi o^;^ ooo 
pp.249-255(1984)re.atingtobk,logicalandimm;.nologica.acStiesosS^^^^ 

?eZ':frrra:tS'""^^^^ 

IrSS «.^H . sequences of proteins other than erythropoietin, a substance for which no substantial 

r;24" nrprrrSa'^bv'? fi'^'T:'-:: '^^-^^"^'"^ ^ ^ •^^^-^ AppiiL^;rseSi.' 

44fi thiri fL ' ^ 9"e. published August 22, 1984 as European Patent Application No 0 116 

^ScLZ^M I ' '"""^^-^''"^^ ^y^^i*^- 'ine (A T.C.C. No. HB8209) which producSThTiJ speciic 

f"i,rererrr^^^^^^ 

following sequence of amino acids: 



NH,Ato.P™.p,,^,^U~.»..C,.^,..S„..^.V.ti.,..0.,..*^TW^,.^^„GI,«.i»l,^COOH. 

3651— 365S nqflqwailj^^n r^^ , ^ ' ®* ^' ■ '^a'- -^ca"- Sc*^ C^^SX;. 80. pp 
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Since the filing of U.S. Patent Application Serial Nos 561 024 and 582 i f« tho,o h=o ^ ■ 

25 the cloning and expression of what Is asserted to have h^Bnh ."^V ^^'^ ^as appeared a single report of 

Of cDNA Clones were insertedTnto £ cSSr^c^I^d ft ^^^^A in E. coli Briefly put, a number 

wrth a monoclonal antibody to unsl^eciirSit^S^^^^^^ '° ''^ immunoreactlve 

SCA (USA). 81. pp. 27(^712^1^ erythropoietin. See. Lee-Huang. Proc. Nat. Acad. 

.^^'^T^Ti^Z"::::^^^ '"'^ ---- -^^ hu„.n 

essuT^S^reTn'^SSexp^^^^^^ 

the goal. As an exaii pTS^lrS T^rL^mato, » erythropoietin have fallen far short of 

from kidney tissues of ^^^.S^^^^^^^^^J^, ' °' ''''''' ^'^^^^•'^^ °' '""^A 

wfth the rather transito,? res^ttT ^f^o Su^^^^^^^ "^"''^ ^^""^"^ ^^"^ «^y««« 

dispbyingbtologK^al properties Of er;;^pS^^;^:^^^^^^^ 

enyrthropoietin per micr^rar^ Sted^^^^^^ ^ f T °' ^ ^^^'"9 ^^^^i-V °' 

for erythropoietin were SwtlaSto bo cot ? f °* °' "'"NA cod ng 

transbtb?i„tothescLgS.t?? SScI^ ,^s^^^^^^^ T *°.k'1''°^ '^^^'^ °' '""^A 

exoression. allowing foMhe consUuc L rn en^hed human k^^^^^ '""l^- ^ °* erythropoietin 

be isolated. [See also, Farber. Cin. Res Sm'!!^^^!^^^^^ 

"''"9°' U S. Patent Application Serial Nos 561 024 and 582 1 f« thorah=o ^ • 

« the cloning and expression of what is asserted to h^lTZ^Tu and 582,1 85. there has appeared a single report of 

Of CDNA cLes were InsTrtitto ? S/Z,Sd^^^^ 

With a monoclonal antibocy to an mt^eM t^S'^H "l^ ^"^"""^ ''^'^ "^"^ '° immunoreactlve 
Scl. (USA). 81. pp. 27^^12 (1 9W) erythropoietin. See, Lee+luang, Proc. Nat. Acad. 
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*o Brief Summary 

The present Invention provldes:- 
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(a) the DNA sequences set out In Tables V and VI or their complementary strands; 

(b) DNA sequences which hybridize under stringent conditions to the protein coding regions of the DNA se- 
quences defined in (a) or fragments thereof; and 

(c) DNA sequences which, but for the degeneracy of the genetic code, would hybridize to the DNA sequences 
defined in (a) and (b). 

2. A DNA sequence according to point 1 encoding human erythropoietin. 

a A cDNA sequence according to point 1 being a monkey species erythropoietin coding DNA sequence. 

4. A DNA sequence according to point 3 and Including the protein coding region set forth In Table V. 

5. A genomic DNA sequence according to point 1 or 2. 

6. A human species erythropoietin coding DNA sequence according to point 5. 

7. A DNA sequence according to point 6 and including the protein coding region set forth in Table VI. 

8. A DNA sequence according to point 1 or 2, covalently associated with a detectable label substance. 

9. A DNA sequence according to point 8, wherein the detectable label is a radiolabel. 

10. A single-strand DNA sequence according to point 8 or 9. 

11. A DNA sequence according to point 1. coding for [Phei6]hEPO. [Phe-^^lhEPO, [Phei^SihEPO [His^lhEPO 
[Asn2 des-Pro2 through lle^JhEPO, [des-Thriej through ArgieejhEPO. orfA27-55]hEPO. 

1 2. A procaryotic or eucaryotic host cell transformed or transfected with a DNA sequence according to any one of 
points 1, 2, 3, 6. 7 and 8, in a manner allowing the host cell to express said polypeptide product. 

iolyj JS""'"'^'* °' transfected host cell according to point 12 which host cell Is capable of glycosylating said 

14. A transformed or transfected mammalian host cell according to point 1 3. 

1 5. A transformed or transfected COS cell according to point 1 3. 

16. A transformed or transfected CHO cell according to point 1 3. 

i^pSnts s'^S ^ ^^^"^"^ ^'"^^ '"'""'^'"^ ^ sequence according to any one 

1 8. A procaryotic or eucaryotic host cell stably transformed or transfected with a DNA vector according to point 1 7. 

19. A recombinant polypeptide having part or all of the primary structural conformation of human or monkey eryth- 
ropoietin as set forth in Table VI or Table V or any allelic varent or derivative thereof possessing the bioloJcal 
property of causing bone marrow cells to increase production of retkiulocytes and red blood cells to Increase 
hemoglobin synthesis or iron uptake and characterized by being the product of eucaryotic expresston of an exog- 
enous DNA sequence and which has higher molecular weight by SDS-PAGE from eiythropoletln isolated from 
urinary sources. / r- 

20. A glycoprotein polypeptide according to point 19 having an average carbohydrate compositk)n whk:h differs 
from that of human erythropoietin isolated from urinary sources. 

21 . A polypeptide according to point 1 9 or 20 wherein the exogenous sequence is a cDNA sequence. 
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22.Apolypeptideaccortin9 to point 19or 20 whereh the exogenous DNA sequence isagen^^ 
fepLTS^^^^^^^^^^ 

25. A polypeptide according to point 24. wherein said detectable label is a radiolabel. 
f2T5'!^tdT*^'"'°"''''''"^^^ 

e™p.,pe,.e;andopti.a?.^ 

28. A process according to point 27, characterized by culturing a host cell of any one of points 1 2 to 16. 

29. A process according to point 27 or 28 for production of a polypeptide of any one of points 1 9 to 23 and 26. 

2^csrrpr:=^^^^^^^ 

S^A pharmaceutical compositcn according to point 30. comprising a polypeptide of any one of points 1 9 to 23 
of their grovWh^n'sx^ess SruTrfeSfn i^^^^^^ 

erythropoietin per ,0S celte in 48 h^urs afdttZr^b^^SLTunrssr ' °' ' ""'^ '"'^"^ °' 

sequre^^crraror^^^^^^^^^ 
rx^otrm-™ 

;r:rmix^Cer--~^ 

poietins o, monkey'andTuman Xirs 5ns ' "'"^'^ °' 
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Z thln^fn, .2. w l"^ °' °' °' """'^ ^'^''^ (i.e.. deletion anatogs con Jning 

!11h k k L"^""^ ^^'"^^ '°' ^"'^^^ substttutton analogs wherein one or more residues sp^S^e 

M^inST P*"yP«P«^^): ^'f'^h Share sorr,e or all the properties of natural^Kxcurring fols 

an^VI Srolm'^ f erythropoietin which are comprehended by: (a) the DNA sequences set out In Tables V 
^.1 . r ?1I ! '=.°^P'""°"'«'y ^•«n<ls: C) DMA seqeuences which hybridize (under hybridization condKns 
such asillustrated here.n or more stringent conditions) to DNA sequences defined in (a) or f ragmS s'hereor^TS 

(eg^'Sotb^eMh'i'Cor^^^^^ 

(b) the sample is fixed to a solid substrate 
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cJiep^^Tm!!!SNT!ZZ^^ constitute me first known instance of the use of mu«ip,e n,ixed oligonu- 
f ir^^i^^ZcS ofJe le^ ?m1r i«"atbn of mammalian genomic clones and the 

NZTrlnt ^ !. °* "^'^ ""^^ 32 Oligonucleotide probes in the isolation of cDNA clones 



Detailed Description 



and mSrspecS: e7h:^>o:r!:e;^^^^^^^^^^ ^ST'T ^" °' '"'^"^''''^^ sequence o, human 

monkey andhumanorigin DNA Srbeen r^^^^^^^^ f ' ^^^^ '^'^ ^"^ characterized. Further, the 
quantities of poVpcSd^^tevfnQ bbl^^^^ 

^th in ./.oaX ribS a^^^^^^ — logK^I) properties of naturalV^curring EPO as well as 

tissur:faLiT5:;?rhS;?di3^^^^^ 

high levels of EPO cornSrad^o n^^lr^^ immunologicalV determined to include 

ending oSll^Z^ed h^^^^^^^^^ ^•^'^^ ^-^'"'"9 

olabelled, 20-mer oligonucleotide probes and involvprf\ha t h ^ hybridization employing a pool of 128 mixed, radi- 
cleot^e probes was LeS o^ SiSLtcTd^Zcet^^^^^^ screening of 200,000 colonies. Design of ,he oligonu- 
a small sample of human ETO ^ '"formation provided by enzymatic fragmentation and sequencing 

tainlnTEpSi^cSXiTif^^^^^^^ Tnl^r " ^^'^ "^^^ '^^^^^ °' '"-es con- 

poo.^ft2amixed2^mro,SL:3^^^^^^^ 

"TirLrernrpi^^^^^^^^ 

sequences within the pooi S S'm*rp,^b21nd sTr^oH ? ^^^^'^omg of clonal DNA using a subset of 16 

resuhing in deduction ^prl^Ttl'^^con^^^^ '° "t^'""' "^"'^^ 

polypeptide sequences displayed a hioh XoTJ nf . , Polypeptides encoded thereby. The deduced 

amino acid ana'iysis of hS EPO fJ^gmems '° ""^ ^ 9«"«^a'e'' "y 

growth Of t,ansfectedho;^ceT^r^Xd^^ultu^^lSIT ''^'° '^^"^ Cultured 

3000 mu of EPO per m^f clrTfluTd supernatant preparations estimated to contain as much as 

cure^rxi^rd:^^^^^^^ 

dures resulting In such identification anH lo ,. ^"^""9 "loni^ey cDNA clones and human genomic clones, to proce- 
verification of Ipo exp^i^^^^^^^^^ ''^^^'^P-"' °' -P--- ^V^t-s and immunological 

m JrZ fS^^'^fiL' ^^^^^^^ °' '-9-n«s and constn.ct.,n of 
involved in the identifiSE^ orpS;?em<^k^^^^^^^^ 

ment and preliminary radioimmS^SsiwRlATan^^l ? information concerning animal treat- 

cDNA library, colony hybSion^r^i^ rin^^^^^ I T "^"^ ^"^"""^ ^ '° Preparation of the 

Clone and the genltiSTrSntey ETO '''''' '^"^"^'"^ °' ^ P^^''^^^ ^^NA 
mation. Example 4 is 67oct^p^2^^ZT7 T"'"'^' conformation (amino acid sequence) hfor- 

provides Infon^ation conSmJg tJ^s^^^orre o^^^^ °' "^'"'"^ genomic clones and thus 
positive clones. Example sr^Jrec^eTtrDNAlour^^ 

EPO polypeptide amino acid s^ncl info^maZ^ ^ ^^"'"^ 9«"«'a«°" °' 

information. Example 6te ^ ecSJto Drir<^S TsT^^ ? ^ f^Panson thereof to the monkey EPO sequence 

from a positive monkercDS2,^^'ruse^^ 

transfected cells. ExarJple 7 is ^e^iS to orc^' ^^^^^^^^ °' ^"""'^ 9^°^^ °' •►'^ 

derived from a positive Euri^ dcSifthe L^^^^^^^^ °' ^ ^^"^'^^ incorporating EPO^ncoding DNA 

growth of the tra^fected cT ixaS eSrSi'j; ,o?^^^^ "^'"^ '^"i'^''^" °' <^e"« ^"^ '^-a '="«"rad 
obtained from the cultured gro«Jh o7JansfI:2r«?. ""7"°^"^^^ procedures perfomied on media supematants 

and/nv^obiok^icalactMjr.ra;X^^^^^ 

andhptpresrrD.^^^^^^^ 

.09.ca,act.n.so,produc.so,these expression systems as weTa^iha^^^^^^ 



11 



EP 0 148 605 B2 



directed to the preparatlcxi of manufactured genes encoding human species EPO and EPO analogs, which genes 
Include a number of preference condons for expression in E. cofi and yeast host cells, and to expression systems 
based thereon. Example 12 relates to the Immunological and biological activity profiles of expression products of the 
systems of Example II. 

Example 1 

A. Human EPO Fragment Amino Acid Sequencing 

Human EPO was Isolated from urine and subjected to tryptic digestion resulting in the development and isolation 
of 17 discrete fragments in quantities approximating 100 — 150 picomoles. 

Fragments were arbitrarily assigned numbers and were analyzed for amino acid sequence by microsequence 
analysis using a gas phase sequencer (Applied Blosystems) to provide the sequence information set out in Table I, 
below, wherein single letter codes are employed and "X" designates a residue which was not unambiguously deter- 
mined. 



TABLE I 



Fraqmer^t No, Sequence Analysis Result 

A.P-P-R 
G-K-L-K 
A-L-G-A-Q-K 
V-L-E-R 
A-V-S-G-L-R 
L-F-R 
K-L-F-R 
Y-L-L-E-A-K 
L-I-C-O-S-R 
L-Y-T-G-E-A-C-R 
T-I-T-A-D-T-F.R 
E-A-I-S-P-P-O-A-A-M-A-A-P-L-R 
E-A-E-X-I-T-T-G-X-A-E-H-X-S'-L- 

N-E-X-I-T-V-P 
V-Y-S-N-F-L-R 
S-L-T-r-L-L-R 
V-N-F-y-A-W-K 
G-Q-A-L-L-V-X-S-S-Q-P-W^ 
E-P-L-Q-L-H-V-D-K 



B. Design and Construction of Oligonucleotide Probe Mixtures 

The amino acid sequences set out in Table I were reviewed in the context of the degeneracy of the genetic code 
for the purpose of ascertaining whether mixed probe procedures could be applied to DN A/DN A hybridizatbn procedures 
on cDN A and/or genomic DNA libraries. This analysis revealed that within Fragment No. T35 there existed a series of 
7 amino acid residues (Val-Asn-Phe-Tyr-Ala-Trp-Lys) which could be uniquely characterized as encoded for by one of 



TAa 

T4b 

T9 

T13 

T16 

T18 

T21 

T25' 

T26a 

T26b 

T27 

T28 

T30 

T31* 
^T33 
T35 
T38 
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128 possible DNA sequences spanning 20 base pairs. A first set ot 128 20-nrier oligonucleotides was therefore syn- 
thesized by standard phosphoamidite methods (See, e.g., Beaucage, at al., Tetrahedron Letters, 22, pp. 1859 — 1862 
(1981) on a solid support according to the sequence set out in Table 11, below. 



5 

TABLE II 





Residue 


- Val 


- Asn 


Phe 


m. 


Ala 






10 


3' 


CAA 
T 


TTG 
A 


AAG 
A 


ATG 
A 


CGA 
T 


ACC 


TT - 5* 



G 
C 



G 

C. 



75 

Further analysis revealed that within fragment No. T38 there existed a series of 6 amino acid residues (Gln-Pro- 
Trp-Glu-Pro-Leu) on the basts of which there could be prepared a pool of 128 mixed oligonucleotide 17-mer probes 



as set out In Table III below. 



20 



30 



TABLE III 



Residue - Gin Pro Trp Glu Pro Leu 

3' GTT GGA ACC CTT GGA GA - 5* 
C T C T A 

G G 
C C 



Oligonucleotide probes were labelled at the 5' end with gamma — 32p_ATP, 7500—8000 Ci/mmole (ION) using 
T. polynucleotide kinase (NEN). 



Example 2 

35 A. Monkey Treatment Procedures and RIA Analysis 



Female Cynomolgus monkeys Macaca fascicufarias (2.5—3 kg, 1.5— 2 years old) were treated subcutaneou sly 
with a pH 7.0 solution of phenylhydrazine hydrochloride at a dosage level of 12.5 mg/kg on days 1, 3 and 5. The 
hematocrit was monitored prior to each injection. On day 7, or whenever the hematocrit level fell betow 25% of the 
Initial level, serum and kkJneys were harvested after administration of 25 mg/kg doses of ketamine hydrochloride. 
Han^ested materials were immediately frozen in Iquki nitrogen and stored at -70*C. 

8. RIAforEPO 



Radioimmunoassay procedures applied for quantitative detectk>n of EPO in samples were conducted according 
to the following procedures: 

An erythropoietin standard or unknown sample was incubated together with antiserum for two hours at 37'*C. After 
the two hour incubation, the sample tubes were cooled on ice, i25|.|abelied erythropoietin was added, and the tubes 
were incubated at C^C for at least 15 more hours. Each assay tube contained 500 ^il of incubation mixture consisting 
of 50 \ii of diluted immune sera. 10,000 cam of i25|.erythropoietin, 5 ^il trasylol and 0—250 \l\ of either EPO standard 
or unknown sample, with PBS containing 0.1% BSA making up the remaining volume. The antiserum used was the 
second test bleed of a rabbit immunized with a 1 % pure preparation of human urinary erythropoietin. The final antiserum 
dilution on the assay was adjusted so that the antibody-bound ^2S|.Epo dW not exceed 10—20% of the input total 
counts. In general, this corresponded to a final antiserum dilution of from 1 :50,000 to 1 :1 00.000. 

The antibody-bound i25|_erythropoietin was precipitated by the addition of 1 50 ^il Staph A. After a 40 min, incuba- 
tion, the samples were centrif uged and the pellets were washed two times with 0.75 ml 10 mM Tris-HCI pH 8.2 con- 
taining 0.1 5M NaCI. 2mM EDTA. and 0.05% Triton X-100. The washed pellets were counted in a gamma counter to 
determine the percent of i25|-erythropoietin bound. Counts bound by pre-immune sera were subtracted from all final 
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values to correct for nonspecific precipitation. The erythropoietin content of the unknown samples was determined by 
comparison to the standard curve. 

The above procedure was applied to monkey serum obtained in Part A. above, as well as to the untreated monkey 
serum. Nomnal serum levels were assayed to contain approximately 36 mil/ml while treated monkey serum contained 
s from 1 000 to 1 700 mU/ml. 

Example 3 

A. Monkey cDNA Library Construction 

10 

Messenger RNA was isolated from normal and anemia monkey kidneys by the guanidinium thiocyanate procedure 
of Chirgwin. at al., Biochemistry, 18, p. 5294 (1 979) and poly (A)+ mRNA was purified by two runs of oligo(dT)-ceullose 
column chromatography, as described at pp. 197-198 in Maniatis, al al., "Molecular Cloning, A Laboratory Manual" 
(Cold Springs Harbor Laboratory, Cold Springs, Harbor, N.Y.. 1982). The cDNA library was constructed according to 

'5 a modification of the general procedures of Okayama. at al., Mol and Ceii. Biol, 2, pp. 161—170 (1982). The key 
features of the presently preferred procedures were as follows: (1) pUC8 was used as the sole vector, cut with PsW 
and then tailed with oligo dT of 60--BO bases In length; (2) H/ndl digestion was used to remove the oligo dT tail from 
one end of the vector; (3) first strand synthesis and oligo dG tailing was carried out according to the published procedure; 
(4) fiamHI digestion was employed to remove the oligo dG tail from one end of the vector; and (5) replacement of the 

20 RNA strand by DNA was In the presence of two linkers 



30 



55 



(GATCTAAAGACCGTCCCCCCCCC and ACGGTCTTTA) 

25 In a three-fold molar excess over the oligo dG tailed vector. 

B. Colony Hybridization Procedures For Screening Monkey cDNA Library 

Transformed E. co//were spread out at a density of 9000 cok>nies per 1 0 x 1 0 cm plate on nutrient plates contain ing 
50 micrograms/ml Ampicillin. GeneScreen filters (New England Nuclear Catalog No. NEF-972) were pre-wet on a 
BHI— CAM plate (Bacto brain heart infusion 37 g/L. Casamino acids 2 g/L and agar 1 5 g/L, containing 500 micrograms/ 
ml Chloramphenicol) and were used to lift the colonies off the plate. The colonies were grown In the same medium for 
1 2 hours or longer to amplify the plasmid copy numbers. The amplified colonies (cotony side up) were treated by serially 
placing the filters over 2 pieces of Whatman 3 MM paper saturated with each of the foltowing solutions: 

(1) 50 mM glucose— 25 mM Tris-HCI (ph 8.0)^10 mM EDTA (pH 8.0) for five minutes; 

(2) 0.5 M NaOH for ten minutes; and 

(3) 1 .0 M Tris-HCI (pH 7.5) for three minutes. 

<o The filters were then air dried in a vacuum over at 80*C for two hours. 

The filters were then subjected to Proteinase K digestion through treatment with a solution containing 50 micro- 
grams/ml of the protease enzyme in Buffer K (0.1M Tris-HCI (pH 8.0)-~O.15M NaCI— 10 mM EDTA (pH 8.2) 0.2% 

SDS). Specifically 5 ml of the solution was added to each filter and the digestbn was allowed to proceed at 55'C for 
30 minutes, after which the solution was removed. 

The filters were then treated wtih 4 ml of a prehybridization buffer (5 X SSPE— 0.5% SDS— 100 microgramms/ 
ml SS E. CO// DNA— 5 X BFP). The prehybrWIzatlon treatment was carried out at 55»C, generally for 4 hours or longer, 
after which the prehybridization buffer was removed. 

The hybridlzatton process was carried out In the following manner. To each filter was added 3 ml of hybridization 
buffer (5 X SSPE— 0.5% SDS— 100 micrograms/ml yeast tRNA) containing 0.025 picomotes of each of the 128 probe 
sequences of Table II (the total mixture being designated the EPV mixture) and the filters were maintained at 48'»C for 
20 hours. This temperature was TO less than the lowest of the calculated dissociation temperatures (Td) determined 
for any of the probes. 

Foltowing hybrkJization, the filters were washed three times for ten minutes on a shaker with 6 x SSC— 0. 1 % SDS 
at room temperature and washed two to three times with 6 x SSC-1% SDS at the hybridizatbn temperature (48»C). 
Autoradiography of the filters revealed seven positive clones among the 200.000 colonies screened. 
Initial sequence analysis of one of the putative monkey cDNA clones (designated clone 83) was performed for 
verificatlonpurposesbyamodificationof the procedure of Wallace, at al.. Gene, 76, pp. 21—26(1981). Briefly, plasmid 
DNA from monkey cDNA ctone 83 was linearized by digestion with EcoRI and denatured by heating in a boiling water 
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bath. The nucleotide sequence was determined by the dideoxy method of Sanger, at al., RN.A.S. (U.S.A.) 74, pp. 
5463 — 5467 (1 977). A subset of the EPV mixture of probes consisting of 16 sequences was used as a primer for the 
sequencing reactions. 

s C. Monkey EPO cDNA Sequencing 

Nucleotide sequence analysis of clone 83 was carried out by the procedures of Messing, Methods in Enzymology, 
101, pp. 20 — 78 (1983). Set out in Table IV is a preliminary restriction map analysis of the approximately 1600 base 
pair EcoRi/HindiW cloned fragment of clone 83. Approximate locations of restriction endo-nuclease enzyme recognition 
10 sites are provided in ternns of number of bases 3' to the EcoFR site at the 5' end of the fragment. Nucleotide sequencing 
was carried out by sequencing Individual restriction fragments with the intent of matching overlapping fragments. For 
example, an overlap of sequence information provided by analysis of nucleotides in a restriction fragment designated 
C1 13 (Sau3A at ~111/Smal at ~324) and the reverse order sequencing of a fragment designated C73 {AliA at 
'-424/BstB\ at -203). 

75 

TABLE iV 
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25 
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Restriction Enzyme 


rtecognnion one 


Approximate Location(s) 


EcoRI 


1 


Sau3A 


111 


Smal 


180 


BstEII 


203 


Smal 


324 


Kpni 


371 


Rsal 


372 


Alul 


424 


PstI 


426 


Alul 


430 


Hpal 


466 


Alul 


546 


PstI 


601 


Pvull 


604 


Alul 


605 


Alul 


782 


Alul 


788 


Rsal 


792 


PstI 


807 


Alul 


841 


Alul 


927 


Ncol 


946 


Sau3A 


1014 


Alul 


1072 


Alul 


1115 


Alul 


. 1223 


PstI 


1301 


Rsal 


1343 


Alul 


1384 


Hindlll 


1449 


Alul 


1450 


Hindlll 


1585 



Sequencing of approximately 1 342 base pairs (within the region spanning the Sau3A site 3" to the Ecot\ site and 
the H/r»cflll site) and analysis of all possible reading frames has albwed for the development of DNA and amino ackJ 
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sequence information set out in Table V. In the Table, the putative initial amino acid residue of the amino terminal of 
mature EPO (as verified by correlation to the previously mentbned sequence analysis of twenty amino terminal resi- 
dues) is designated by the numeral +1. The presence of a methionine-specifying ATG codon (designated -27) "up- 
stream*) of the initial amino temfiinal alanine residue as the first residue designated for the amino acid sequence of 
the mature protein is indicative of the likelihood that EPO is initially expressed in the cytoplasm in a precursor form 
including a 27 amino acid "leader" region which is excised prior to entry of mature EPO into circulation. Potential 
glycosylation sites within the polypeptide are designated by asterisks. The estimated molecular weight of the translated 
region was determined to be 21 .117 daltons and the M.W. of the 165 residues of the polypeptide constituting mature 
monkey EPO was determined to be 18,236 daltons. 
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TABLE V 



Translation of Monkey EPO cDNA 



Sau3A 

gaTcccgcgccccctggacagccgccctctcctccaggcccgtggggctggccctgccc 
cgctgaacttcccgggatgaggactcccggtgtggtcaccgcgcgcctaggtcgctgag 



-27 -20 
Met Gly Val His Glu Cys Pro Ala Trp 
GGACCCCGGCCAGGCGCGGAGATG GGG GTG CAC GAA TGT CCT GCC TGG 

-10 

Leu Trp Leu Leu Leu Ser Leu Val Ser Leu Pro Leu Gly Leu Pro 
CTC TGG CTT CTC CTG TCT CTC GTG TCG CTC CCT CTG GGC CTC CCA 

-1 +1 10 
Val Pro Gly Ala Pro Pro Arg Leu He Cys Asp Ser Arg Val Leu 
GTC CCG GGC GCC CCA CCA CGC CTC ATC TGT GAC AGC CGA GTC CTG 

20 ♦ 
Glu Arg Tyr Leu Leu Glu Ala Lys Glu Ala Glu Asn Val Thr Met 
GAG AGG TAC CTC TTG GAG GCC AAG GAG GCC GAG AAT GTC ACG ATG 

30 ♦ 40 

Gly Cys Ser Glu Ser Cys Ser Leu Asn Glu Asn He Thr Val Pro 
GGC TGT TCC GAA AGC TGC AGC TTG AAT GAG AAT ATC ACC GTC CCA 



Asp Thr Lys Val Asn Phe Tyr 
GAC ACC AAA GTT AAC TTC TAT 



50' 

Ala Trp Lys Arg Met Glu Val Gly 
GCC TGG AAG AGG ATG GAG GTC GGG 



60 70 
Gin Gin Ala Val Glu Val Trp Gin Gly Leu Ala Leu Leu Ser Glu 
CAG CAG GCT GTA GAA QTC TGG CAG GGC CTC GCC CTG CTC TCA GAA 

80 ♦ 

Ala Val Leu Arg Gly Gin Ala Val Leu Ala Asn Ser Ser Gin Pro 
GCT GTC CTG CGG GGC CAG GCC GTG TTG GCC AAC TCT TCC CAG CCT 

90 100 
Phe Glu Pro Leu Gin Leu His Met Asp Lys Ala He Ser Gly L^u 
TTC GAG CCC CTG CAG CTG CAC ATG GAT AAA GCC ATC ACT GGC CTT 

110 

Arg Ser He Thr Thr Leu Leu Arg Ala Leu Gly Ala Gin Glu Ala 
CGC AGC ATC ACC ACT CTG CTT CGG GCG CTG GGA GCC CAG GAA GCC 
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TABLE V (continutedl 



120 



lie Ser Leu Pro Asp Ala Ala Ser Ala nia p,.« i « . P° 
ATC TCC CTC CCA CAT rrr rrr rrJ 2i5 He 
AIL ICC CTC CCA GAT GCG GCC ICG GOT GCT CCA CTC CGA ACC ATC 



140 

Thr Ala Asp Thr Phe Cys Lvs Leu Phi* At-« u^i ^ ^ ^ 
ACT GCT GA? ACT TTC T^C .11 t5c" TtI clt g'?J ]ll !j? 

160 

Leu Arg Gly Lys Leu Lys Leu Tyr Thr Gly Glu Ala Cvs Am am 
CTC CGG. G6A AAG CTG AAG CTG TAC ACG CM MG Gcl ill fit III 

165 

Gly Asp Arg OP 

GGG GAC AGA TGA CCAGGTGCGTCCAGCTGGGCACATCCACCACCTCCCTCACCAACA 
CTGCCTGTGCCACACCCTCCCTCACCACTCCCGAACCCCATCGAGGGGCTCTCAGCTAAG 

CGCCAGCCTGTCCCATGGACACTCCAGTGCCAGCAATGACATCTCAGGGGCCAGAGGAAC 
TGTCCAGAGCACAACTCTGAGATCTAAGGATGTCGCAGGGCCAACTTGAGGGCCCAGAGC 
AGGAAGCATTCAGAGAGCAGCTTTAAACTCAGGAGCAGAGACAATGCAGGGAAAACACCT 
GAGCTCACTCGGCCACCTGCAAAATTTGATGCAGGACACGCTTTGGAGGCAATTTACCTG 
TTTTTGCACCTACCATCAGGGACAGGATGACTGGAGAACTTAGGTGGCAAGCTGTGACTT 
CTCAAGGCCTCACGGGCACTCCCTTGGTGGCAAGAGCCCCCTTGACACTGAGAGAATATT 
TTGCAATCTGCAGCAGGAAAAATTACGGACAGGTTTTGGAGGTTGGAGGGTACTTGACAG 
6TGTGTGGGGAAGCAG6GCGGTAGGGGTGGAGCT6GGATGC6AGTGAGAACCGTGAAGAC 

AGGATGGGGGCTGGCCTCTGCTTCTCGTGGCGTCCAAGCTT 

Hindlll 

The polypeptide sequence of Table V may readily be subjected to analysis for the presence of highly hydrophilic 
regions and/or secondary confornfational characteristics indicative of potentially highly Immunogenic regions by e g 
n^'"T,^fooor°^.^' ^-iu'- 3824-3828 (1981) and Kyte at al., JMoLBiol.. 157. pp. 

' • '3. pp. 222-245 {^97 4) and Advances in Enzymotogy, 47,pp 45-47 

(1978). Computer-assisted analysis according to the Hopp, at al. method is available by means of a program designated 
PEP Reference Section 6.7 made available by Intelligenetlcs. Inc.. 124 University Avenue. Pato Alto. California. 

Example 4 

A. Human Genomic Library 

^ ^^^^ Phage-bome human fetal liver genomic library prepared according to the procedures of Lawn, at al Cell 
IB. pp. 533—543 (1979) was obtained and maintained for use in a plaque hybrWizatlon assay 
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B. Plaque Hybridization Procedures For Screening Hunian Genomic Library 

Phage particles were lysed and the DNAs were fixed on filters (50,000 plaques per filter) according to the proce- 
dures of Woo. Methods In Enzymohgy, 68, pp. 389—395 (1979) except for the use of GeneScreen Plus filters (New 
England Nuclear Catalog No. NEF^976) and NZYAM plates (NaCI. 5 g; MgClg— SHgO, 2 g; NZ-Amine A. 10 g; yeast 
extract, 5 g; casamino acids, 2 g; ruaftose; 2 g; and agar, 15 g per liter). 

The air-dried filters were baked at 80»C for 1 hour and then digested with Proteinase K as described in Example 
3, Part B. Prehybridization was carried out with a 1M NaCI— 1% SDS buffer for 55»C for 4 hours or more, after which 
the buffer was removed. Hybridizatbn and post-hybridization washings were carried out as described in Example 3, 
Part B. Both the mixture of 128 20-mer probes designated EPV and the mixture of 128 17-mer probes of Table III 
(designated the EPQ mixture) were employed. Hybridization was carried out at 48X using the EPV probe mixture 
EPQ probe mixture hybridization was carried out at 46X~4 degrees below the lowest calculated Td for members of 
the mixture. Removal of the hybridized probe for rehybrldlzation was accomplished by boiling with 1 X SSC — 0 1 % 
SDS for two minutes. Autoradiography of the filters revealed three positive clones (reactive with both probe mixtures) 
among the 1 .500,000 phage plaques screened. Verification of the positive clones as being EPO-encoding was obtained 
through DNA sequencing and electron micrographic visualization of heteroduplex formation with the monkey cDNA of 
Example 3. This procedure also gave evidence of multiple introns In the genomic DNA sequence. 

Example 5 



Nucleotkie sequence analysis of one of the positive ctones (designated AhEl) was carried out and results obtained 
to date are set out in Table VI. 
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In Table VI. the initial continuous DNA sequence designates a top stranci of 620 bases in what Is apparently an 
untranslated sequence immediately preceding a translated portion of the human EPO gene. More specrfically the 
sequence appears to comprise the 5* end of the gene which leads up to a translated DNA region coding for the first 
four ammo acids (-27 through >24) of a leader sequence (-presequence"). Four base pairs in the sequence prior to that 
encoding the beginning of the.leader have not yet been unambiguously detemilned and are therefore designated by 
an X • There then follows an intron of about 639 base pairs (439 base pairs of which have been sequenced and the 
remaining 200 base pairs of which are designated -I.S/) and immediately preceding a codon for glutamine which has 
been designated as residue >23 of the translated polypeptide. The exon sequence immediately following is seen to 
code for ammo acid residues through an alanine residue (designated as the +1 residue of the amino acid sequence 
of mature human EPO) to the codon specifying threonine at position +26. whereupon there follows a second intron 
consisting of 256 bases as specifically designated. Foltowing this intron is an exon sequence for amino acid residues 
^7 through 55 and thereafter a third intron comprising 612 base pairs commences. The subsequent exon codes for 
residues 56 through 115 of human EPO and there then commences a fourth intron of 1 34 bases as specified Following 
the fourth intron Is an exon coding for residue Nos. 116 through 1 66 and a 'stop- codon (TGA). Finally. Table VI identifies 
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a sequence of 568 base pairs in what appears to be an untranslated 3" region of the human EPO gene, two base pairs 
of which ("X*) have not yet been unambiguously sequenced. 

Table VI thus senses to identify the primary structural conformation (amino acid sequence) of mature human EPO 
as including 166 specified amino acid residues (estimated M.W. = 18.399). Also revealed in the Table is the DNA 
sequence coding for a 27 residue leader sequence along with 5' and 3' DNA sequences which may be significant to 
promoter/operator functions of the human gene operon. Sites for potential glycosylatidn of the mature human EPO 
polypeptide are designated in the Table by asterisks. It is worthy of note that the specific amino acid sequence of Table 
VI likely constitutes that of a naturally occurring allelic form of human erythropoietin. Support for this posHlon is found 
in the results of continued efforts at sequencing of urinary isolates of human erythropoietin which provided the finding 
that a significant number of erythropoietin molecules therein have a methionine at residue 1 26 as opposed to a serine 
as shown In the Table. 

Table VII. below, illustrates the extent of polypeptide sequence homotogy between human and monkey EPO In 
the upper continuous line of the Table, single letter designations are emptoyed to represent the deduced translated 
pohrpeptide sequences of human EPO commencing with residue -27 and the lower continuous line shows the deduced 
jjohrpeptide sequence of monkey EPO commencing at assigned residue number -27. Asterisks are emptoyed to high- 
light the sequence homotogies. It should be noted that the deduced human and monkey EPO sequences reveal an 
additional- lysine (K) residue at (human) position 116. Cross-reference to Table VI indtoates that this residue is at the 
margin of a putative mRNA splice junction In the genomic sequence. Presence of the lysine reskJue in the human 
polypeptide sequence was further verified by sequencing of a cDNA human sequence clone prepared from mRNA 
isolated from COSP— 1 cells transfored with the human genomic DNA in Example 7 infm 
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Example 6 



55 



tiH« ^"^'"P'^ °' 'sol^'able quantities of EPO polypep- 

h^t ceils (i.e.. COS-1 cells. A.T.C.C. No. CRL-1650). The cells were transfected with a 'shuttle' vector capable of 

fhv v^r^^fr" ^''y ""^"^ °' •'^^ f"^^ pBR322-derived DNA) and the mammalian h^^ 

(by virtue ot the presence of SV40 virus-derived DNA). "«mmaiian nosis 

More spectflcally. an expression vector was constructed according to the following procedures. The plasmid clone 
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83 provided in Example 3 was amplified in E.co//and the approximately 1 .4 kb monkey EPO-encoding DN A was isolated 
by EcoRI and HinM digestion. Separately isolated was an approximately 4.0 kb. Hindi\\/Sal\ fragment from pBR322. 
An approximately 30 bp, EcoRi/Sal\ "linker" fragment was obtained from M13mp10 RF DNA (P and L Laboralorles). 
This linker included, in series, an EcoRI sticky end. followed by Ssfl.Smal. BamHI and Xba\ recognitton sites and a 
Sail sticky end. The above three fragments were ligated to provide an approximately 5.4 kb intermediate plasmid 
("pERS") wherein the EPO DNA was flanked on one side by a "bank" of useful restriction endonuclease recognition 
sites. pERS was then digested with HincAW and Sah to yield the EPO DNA and the EcoRI to Sail (M13mp10) linker. 
The 1 .4 kb fragment was ligated with an approximately 4.0 kb BamH\/Sali of pBR322 and another M13mp10 Hind{\\/ 
BarriH\ RF fragment linker also having approxinnately 30 bp. The M13 linker fragment was characterized by a HindW 
stteky end, followed by f^sfl, Sah, Xba\ recognition sites and a BamI sticky end. The ligation product was, again, a 
useful intermediate plasmid ("pBR— EPO") including the EPO DNA flanked on both sides by banks of restriction site. 

The vector chosen for expression of the EPO DNA In COS~1 cells ("pDSVLI ") had previously been constructed 
to altow for selection and autonomous replication in E.coli. These characteristics are provided by the origin of replication 
and Ampicillin resistance gene DNA sequences present in the region spanning nucleotides 2448 through 4362 of 
pBR322. This sequence was structurally modified by the addition of a linker providing a W/noflll recognition immediately 
adjacent nucleotide 2448 prior to incorporation into the vector. Among the selected vector's other useful properties 
was the capacity to autonomously replicate in COS— 1 cells and the presence of a viral promoter sequence functional 
in mammalian cells. These characteristics are provided by the origin of replication DNA sequence and "late gene" viral 
promoter DNA sequence present in the 342 bp sequence spanning nucleotide numbers 51 71 through 270 of the SV40 
genome. A unique restrictbn site (BanihW) was provided in the vector and immediately adjacent the viral promoter 
sequence through use of a commercially available linker sequence (Collaborative Research). Also incorporated in the 
vector was a 237 base pair sequence (derived as nucleotide numbers 2553 through 2770 of SV40) containing the "late 
gene- viral mRNA polyadenylation signal (commonly referred to as a transcription terminator). This fragment was po- 
sitioned in the vector in the proper orientation vis-a-vis the "late gene" viral promoter via the unique BarriHi site. Also 
present in the vector was another mammalian gene at a kjcation not material to potential transcription of a gene inserted 
at the unique BanHl site, between the viral promoter and terminator sequences. [The mammalian gene comprised an 
approximately 2,500 bp mouse dihydrofolate reductase (DHFR) minigene isolated from plasmid pMG-1 as in Gasser, 
at al.. RN.AS. (U.S.A), 79, pp. 6522—6526. (1982).] Again, the major operative components of plasmid pDSVLI 
comprise nucleotides 2448 through 4362 of pBR322 along with nucleotides 5171 through 270 (342 bp) and 2553 
through 2770 (237 bp) of SV40 DNA. 

Following procedures described, e.g.. in Maniatis. at al., supra, the EPO-encoding DNA was isolated from plasmid 
pBR— EPO as a BamHI fragment and ligated into plasmid pDSVLI cut with BamHI. Restriction enzyme analysis was 
employed to confirm insertion of the EPO gene in the correct orientation in two of the resulting cloned vectors (duplfcate 
vectors H and L). See Figure 2. illustrating plasmid pDSVL— MkE. Vectors with EPO genes in the wrong orientation 
were saved for use as negative controls in transf action experiments designed to determine EPO expression levels in 
hosts transformed with vectors having EPO DNA in the corect orientation. 

Vectors H. L, F, X and G were combined with carrier DNA (mouse liver and spleen DNA) were employed to transf ect 
duplicate 60 mm plates by calcium phosphate microprecipitate methods. Duplicate 60 mm plates were also transfected 
with carrier DNA as a "mock" transformation negative control. After five days all culture media were tested for the 
presence of polypeptides possessing the immunological properties of naturally-occurring EPO. 

Example 7 

A. Initial EPO Expresston System Involving COS— 1 Cells 

The system selected for initial attempts at microbial synthesis of isotatable quantities of human EPO polypeptide 
material coded for by the human genomic DNA EPO clone, also involved expression in mammalian host cells (i.e., 
COS— 1 cells. A.TC.C. No. ORL— 1650). The human EPO gene was first sub-cloned into a "shuttle" vector which is 
capable of autonomous replication in both E.co// hosts (by virtue of the presence of pBR322 derived DNA) and in the 
mammalian cell line COS— 1 (by virtue of the presence of SV40 virus derived DNA). The shuttle vector, containing the 
EPO gene, was then transfected into COS— 1 cells. EPO polypeptide material was produced in the transfected cells 
and secreted into the cell culture media. 

f^ore specifically, an expressbn vector was constnjcted according to the following procedures. DNA Isolated from 
lambda clone AhEl, containing the human genomic EPO gene, was digested with BamHI and H/ndlll restriction endo- 
nucleases, and a 5.6 Kb DNA fragment known to contain the entire EPO gene was isolated. This fragment was mixed 
and ligated with the bacterial plasmid pUC8 (Bethesda Research Laboratories. Inc.) which had been similarly digested, 
creating the intermediate plasmid "pUCS— HuE", providing a convenient source of this restriction fragment. 

The vector chosen for expression of the EPO DNA In COS— 1 cells (pSV4SEt) had previously been constructed 
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Plasmid pSV4SEt contained DNA sequences allowing selection and autonomous replication in E.coH. These charac- 
teristics are provided by the origin of replication and Ampiciliin resistance gene DNA sequences present In the region 
spanning nucleotides 2448 through 4362 of the bacterial plasmid pBR322. This sequence was structurally modified 
by the addition of a linker providing a HindiW recognition site Immediately adjacent to nucleotide 2448. Plasmid pSV4SEt 
was also capable of autonomous replicatbn in COS — 1 cells. This characteristic was provided by a 342 bp fragment 
containing the the SV40 virus origin of replication (nucleotide numbers 5171 through 270). This fragment has been 
modified by the addition of a linker providing an EcoRI recognition site adjacent to nucleotide 270 and a linker providing 
a Sa/l recognition site adjacent nucleotide 5171 . A 1061 bp fragment of SV40 was also present in this vector (nucleotide 
numbers 1711 through 2772 plus a linker providing a Sail recognition site next to nucleotide number 2772). Within this 
fragment was an unkjue Sa/nHI recognition sequence. In summary, plasmki pSV4SEt contained unique BarriH\ and 
HindiW recognition sites, allowing insertion of the human EPO gene, sequences allowing replteation and selection in 
E.coti, and sequences allowing replication in COS — 1 cells. 

In order to insert the EPO gene into pSV4SEt. plasmid pUC8— HuE was digested with BannHl and Hindi II restriction 
endonucleases and the 5.6 kb EPO encoding DNA fragment isolated. pSV4SEt was also digested with BamHI and 
Hindu and the major 2513 bp fragment isolated (preserving all necessary functk)ns). These fragments were mixed 
and llgated, creating the final vector "pSVgHuEPO". (See. Figure 3.) This vector was propagated in Eco// and vector 
DNA isolated. Restrictkxi enzyme analysis was employed to confirm insertion of the EPO gene. 

Plasmid pSVgHuEPO DNA was used to express human EPO polypeptide material in COS — 1 cells. More specif- 
ically. pSVgHuEPO DNA was combined with carrier DNA and transfected into triplicate 60 mm plates of COSP-1 cells. 
As a control, carrier DNA alone was also transfected into COS — 1 cells. Cell culture media were sampled five and 
seven days later and tested for the presence of polypeptides possessing the Immunological properties of naturally 
occurring human EPO. 

B. Second EPO Expression System Involving COS — 1 Cells 

Still another system was designed to provide Improved production of human EPO polypeptide nnaterlal coded by 
the human genomic DNA EPO clone in COS — 1 cells (A.T.C.C. No. CRL — 1650). 

In the immediately preceding system, EPO was expressed in COS — 1 cells using its own promoter which is within 
the 5.6 Kb SamHl to HindiW restriction fragment. In the following construction, the EPO gene Is altered to that it is 
expressed using the SV40 late promoter 

More specifically, the cloned 5.6 Kb BamHi to HindiW genomic human EPO restriction fragment was modified by 
the following procedures. Plasmid pUC8 — HuE. as described above, was cleaved with BarrHi and with BstBi restric- 
tion endonucleases. BstBi cleaves within the 5.6 Kb EPO gene at a position which is 44 base pairs 5' to the initiating 
ATG coding for the pre-peptlde and approximately 680 base pairs 3' to the H/ndIM restriction site. The approximately 
4900 base pair fragment was isolated. A synthetic linker DNA fragment, containing Sail and BstBi sticky ends and an 
Internal BarDHI recognition site was synthesized and purified. The two fragments were mixed and llgated with plasmid 
pBR322 v^ich had been cut with Sail and BamHI to produce the intermediate plasmid pBRgHE. The genomic human 
EPO gene can be isolated therefrom as a 4900 base pair BamHl digestion fragment carrying the complete structural 
gene with a single ATG 44 base pairs 3' to Sa/nHI site adjacent the amino terminal coding region. 

This fragment was isolated and inserted as a BaniHi fragment Into BamHi cleaved expression vector plasmid 
pDSVLI (described in Example 6). The resulting plasmid. pSLVgHuEPO. as illustrated In Figure 4, was used to express 
EPO polypeptide material from COS— 1 cells, as described in Examples 6 and 7A. 

Example 8 

Culture media from growth of the six transfected COS — 1 cultures of Example 6 were analyzed by radioimmu- 
noassay according to the procedures set forth in Example 2, Part B. Each sample was assayed at 250. 125, 50. and 
25 microliter aliquot levels. Supernatants from growth of cells mock transfected or transfected with vectors having 
incorrect EPO gene orientation were unambiguously negative for EPO immunoreactivity. For each sample of the two 
supernatants derived from growth of COS — 1 cells transfected with vectors (H and L) having the EPO DNA in the 
correct orientation, the % inhibition of i25| — £pQ binding to antibody ranged from 72 to 88%, which places all values 
at the top of the standard curve. The exact concentration of EPO in the culture supernatant could not then reliably be 
estimated. A quite consen/ative estinrete of 300 mUAnl was made, however, from the value calculation of the largest 
atk^uot size (250 microliter). 

A representative culture fluid according to Example 6 and five and seven day culture fluids obtained according to 
Example 7A were tested in the RIA in order to compare activity of recombinant monkey and human EPO materials to 
a naturally-occurring human EPO standard and the results are set out in graphic form in Figure 1. Briefly, the results 
expectedly revealed that the recombinant monkey EPO significantly competed for anti-human EPO antibody although 
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IITk- " .t '° l°o^'l'^'^ ^""^^ """^^ conditions. The maximum percent inhibition values for re- 

comb«ant human EPO. however, closely approximated those of the human EPO standard. The parallel nature of the 
^B^^luL^^Z^r!'^^' immunological identity of the sequences (epitopes) in common. Prior estimates of mon- 
i^L p . TZ ^""^"^^'^ re-evaluated at these higher dilution levels and were found to range from 2.91 to 3 12 
^d seztT^mffo ,r corresponding^ set at 302 mUAnI for the fiv^y growth sampte 

sion svItT,^ wl l„r''" ' a^^^^.^^-^P'*- Estimated monkey EPO production levels in the ExiTpte 78 expres- 
sion system were on the same order or better. 



Example 9 



,r.^^ "lZl^^ ^^^^^^^ 

to the pracedure Of Goldwasse, at al.. Endocrinology. 97. 2. pp. 315-323 (1975). Estimated monkey EPO vaS 
cufture fluids tested ranged from 3.2 to4.3UAnl.HumanEPO culture fluids were also active in this J^^^^^ 
o E^mofe 6 1? ^ ^"«-EPO antibody The recombinant monkey EPO cu Itu ^ fl^Ss'L^ 

to Example 6 were also subjected to an assay for in vivo biologfcal activity according to the general procedurS rS 

and«;S; r' '''j.^''- ^°«^1067(1961)andHammond,atal...*J.A/.v:/teartSci ug p^s HT^^) 
and activity levels ranged from 0.94 to 1.24 U/ml. w-oio— o^/ jisoo) 



Example 10 



l!f cS?^ , ^ "^'^"'^ '^P'"^'^ '^^"^ '^"^ '° Pf«sence of SV40 T antigen within the cellTnd 

an SV40 origin of replication on the vectors. Though these vectors produce useful quantities of froTc^JSlTceTs 

tJ^atipTbreVC™^^^^ 

CHO DHFR- cells (DuX-BII) CHD K1 cells, Uriaub. at al.. Proc. Nat Acad. Sci. (U.SA ) Vol 77 4461 f1980l 
lack the enzyme dihydrofolate reductase (DHFR) due to mutations in the structural genes and thlfL^u e S 

gr-iubHO (Example 7B) were Iransfected a ona with carrier DNIA into rwn nucQ. ^«ti • • k^^vi_ 

pprf ".""'^ ^"""^ ^"^'"^ ""^'^ the presence of recombinant monkey or human 

cS-^ cLIi 1' T ?T T^^-""'^"^ ^^-^ EPO With ImmuJotogical properties like haTSined fZ 
eS^O.S ulT ' P^^^-^^-^EPO. A representative 65 hour culture fluid con.a^ed monkey 

„oio?i!i!r ""'^ PSVgHuEPO and CHO pDSVL-gHuEPO contained recombinant human EPO with immu- 

n«^og cal properties like that obtained with COS_l cells Iransfected with plasmid pSVgHuEPOrpOm-^HuETO 
A representat^re 3 day culture fluid from CHO pSVgHuEPO contained 2.99 U/^nl of human EPO^d a 5 S^v^nS 
from CHO pDSVL-gHuEPO had 1 8.2 U/ml of human EPO as measured by the Rl A ^ ^ 

n«wl n f EPO produced by the cell strains described above can be increased by gene amplification aivina 

hv^^Thinr !fl ? ""^ "^'"3 methotrexate (MTX). More specifically, cells propagated in media lackina 
trK of S^r ? '? '"'''"'^ '"'^ •'^^ appropriate conditLs. 5g.. r^inTr^al co^cen 

MtT^.! r ' '° ^"'^ '° 9^°^^ "^TX obtained. These cells are fouSd to te resfetenUo 
J^^^ZIT TP'""^'^" °* °' '""'^ f«^"«'"9 increased productbn 0 0?/^^! ° 

^eaterlZ'S mF^RUTe^^ir"' ""'^ '"T concentrattons o, MTX. resuLg in cell strains comailg 
n~.«!^ .r f ? T ^ Passenger genes' (e.g., EPO) carried on the expression vector alonq with the DHFR 
gene or transfomied wrth the DHFR gene are frequently found also to be increased in their gene C dumber. 
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As examples of practice of this amplification system, cell strain CHO pDSVL— MkE was subjected to Increasing 
MTX concentratwns (0 nM. 30 nM and 1 00 nM). Representative 65-hour culture media samples from each amplification 
step were assayed by RIA and detemiined to contain 0.60. 2.45 and 6.10 UAnl. respectively. Cell strain CHO pDSV- 
L— gHuEPO was subjected to a series of increasing MTX concentrations of 30 nM. 50 nM, 100 nM. 200 nM, 1 ^M 
and 5 \iM MTX. A representative 3-day culture media sample from the 100 nM MTX step contained human EPO at 
3089 ± 129 u/ml as judged by RIA. Representative 48 hour cultural medium samples from the 100 nM and 1 uM MTX 
steps contained, respectively, human EPO at 466 and 1352 U/ml as judged by RIA (average of triplicate assays). In 
these procedures, 1 X 10^ cells were plated in 5 ml of media in 60 mm culture dishes. Twenty-four hours later the 
media were removed and replaced with 5 ml of serum-free media (high glucose DMEM supplemented with 0 1 mM 
non-essential ammo acids and L-glutamine). EPO was allowed to accumulate for 48 hours in the serum-free media 
The media was collected for RIA assay and the cells were trypsinized and counted The average RIA values of 467 
U/ml and 1 352 U/ml for cells grown at 1 00 nM and 1 nM MTX. respectively, provided actual yields of 2335 U/plate and 
6750 U/plate. The average cell numbers per plate were 1.94 X 10* and 3.12 X 106 cells, respectively. The effective 
production rates for these culture conditions were thus 1264 and 2167 U/IO^ cells/48 hours 

The cells in the cultures described immediately above are a genetically heterogeneous population Standard 
screening procedures are being employed in an attempt to isolate genetically homogeneous clones with the highest 
production capacity. See. Section A. Part 2. of "Points to Consider in the Characterization of Cell Lines Used to Produce 
Biologies , June 1. 1984. Office of Biologies Research Review. Center for Drugs and Biologies. U.S Food and Dmo 
Administration. ^ 

The productivity of the EPO producing CHO cell lines described above can be improved by appropriate cell culture 
techniques. The propagation of mammalian cells in culture generally requires the presence of serum in the growth 
rnedia. A method for production of erythropoietin from CHO cells in media that does not contain serum greatly facilitates 
the purification of erythropoietin from the culture medium. The method described below is capable of economically 
producing erythropoietin in serum-free media in large quantities sufficient for production. 

Str^n CHO pDSVL-gHuEPO cells, grown in standard cell culture conditions, are used toseed spinner cell culture 
=n f L° Propagated as a suspension cell line in the spinner cell culture flask in media consisting of a 
l^^r^T'^ r!t ^« ^^'^ ^"^^ supplemented with 5% fetal calf serum. L-glutamine. Penicillin 

S Jr'^ 7°'"' non-essential amino acids and the appropriate concentration of methotrexate. Suspension 

cell culture altows the EPO^roducing CHO cells to be expanded easily to large volumes. CHO cells, grown in sus- 
r "^"'^^ ^' ^" "^'^'"9 of 1 .5 X 107 viable cells per 850 cm2 roller bottle 

in 200 ml of media. The cells are allowed to grow to confluency as an adherent cell line over a three-day perkxJ The 

11" J.°'J ^^^^ °' ^"^^ ^ ^'"^^ suspenskjn. At the end of the three^ay 

growth period, the serum containing media is removed and replaced with 100 ml of serum-free media; 50-50 mixture 
of high g ucose DMEM and Ham's F12 supplemented with 0.05 mM non-essential amino acids and L-glutamine The 
roller bottles areretumed to the roller bottle incubator for a period of 1-3 hours and the media again is removed anS 
replaced wrth 1 00 ml of fresh serum-free media. The 1 -3 hour incubatton of the serum-free medTa reduc^s me cd^ 
cent ration of contaminating serum proteins. The roller bottles are retumed to the incubator for seven days during which 

S^iTh " '^"""-'^^^ '^"""'^ ""^"^ °' ^^^'^ Productfon phase, the condi- 

^one^ media is removed and replaced with fresh serum-free medium for a second production cycle. As an example 
of the practice of the production system, a representative seven^Jay. serum-free media sample contained human 
erirthropoietin at 3892 ± 409 Ulml as judged by the RIA. Based on an estimated cell density of 0 9 to 1 8 X 105 cX 
an . each 850 cm2 roller bottle contained from 0.75 to 1 .5 x 108 cells and thus the rate of production of EPO in the 
7-day. 100 ml culture was 750 to 1470 H/106 cells/48 hours. 

Culture fluids from cell strain CHO pDSVL-MkEPO carried in 10 nM MTX were subjected to RIA in vitro and in 
!r o f n nl?l f "^^ "^^^^^ -"edia sample container 41 .2 ± 1 .4 U/ml of MkEPO as measured by the 
„iL ^ ; ° ^ '"eaeured by the in vilm biological activity assay and 42.5 ± 5 U/ ml as measured by the in 

wobiological activity assay. Amino acid sequencing of polypeptide products revealed the presence of EPO products 
a pnnciple species having 3 reskJues of the -leader" sequence adjacent the putative amino terminal alanine Whether 
this IS the result of incorrect membrane processing of the polypeptide in CHO cells or reflects a difference in structure 
Of the ammo terminus of monkey EPO vis-a-vis human EPO, is presently unknown' 

Culture fluids from cell strain CHO pDSVL-gHuEPO were subjected to the three assays. A 5.5 day sample con- 

Sh^r «?Tn?f/"?K"'^" -^^ ^ ^ °' U/"" "y ™A assay. 15.8 ± 4.6 U/ ml by /n viui assay 
ana 16.8 ± 3.0 U/ml by m vivo assay. ' 

Culture fluid from CHO pDSVL-gHuEPO cells prepared amplified by stepwise 1 00 nM MTX were subjected to the 
three assays. A 3.0 day sample contained recombinant human EPO at a level of 3089 ± 1 29 U/ml by RIA 2589 ±715 
U/ml by /n vrfroassay. and 2040 ± 160 U/ml by /n Wvoassay. Amino acid sequencing of this product reveals an amino 
terminal corresponding to that designated in Table VI. 

Cell conditioned media from CHO cells Iransfected with plasmid pDSVL-MkE in 10 nM MTX were pooled, and 
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the MTX dialyzed out over several days, resulting in media with an EPO activity of 221 ± 5.1 U/ ml (EPO-CCM) To 
determine the ,,7 wvo effect of the EPO-CCM upon hematocrtt levels in normal Balb/C mice, the folLingrpeZent 
?S S'vS us:; CHO cells (CCM) and EPC^CM were adiuSed wTh 

PBS. CCM was used for the control group (3 mice) and two dose levels of EPO-CCM - 4 units per iniectbn and 44 
sLL ml? °"- ~ experimental groups (2 mice/group). Over the couL o 5 weeL the 
the lTro, !f '"•'^.P^"»°"^-"y- 3 tinges per week. After the eighth injection, average hematocrit values for 

the control g oup were determined to be 50.4%; tor the 4U group. 55.1%; and. forthe 44U group 67 9% 

using Src ;?iToZtr«rT T '^^"'^ '^^^^^^ substantany purified form from culture med« 
using MKLQ^ (C4) employing an ethanol gradient, preferably at pH7 

CO^rrjT HO^"T' '° ^''^^'^•^"'^ recombinant glycoprotein products from conditioned medium of 

COS-1 and OHO cell expression of the human EPO gene in comparison to human urinary EPO isolates usino both 
western bto. analysis and SDS-PAGE. These studies indicated that the CHO-produced EPo\iSl^r?omr 
What higher molecubr weight than the COS-1 expression product which, in turn, was slightly larger than ^ e cXl 
source human urinary extract. All products were somewhat heterogeneous. NeuraminkJase enzj^S,e treatment to r^ 
Z lS"n^lT"'? "^^^ recoa^tmsnx products of approximate^ equal moLula'weTght\S 

uln. ? """etheless larger than the resulting asialo human urinary extract. Endoglyco^idase F enzyme Ic S2T) 
treatment of the recombinant CHO product and the urinary extract product to totalV remove carbo^Se from boiJ 
""Gt^r^^r^'S^ homogeneous products having essential^ identical molecL weigh? cSelJi^ 
strJS V^'"*^ P'*'^"* '^^'^ ^''"^ comprehensive of products having a primary 

^InT^S^curgTXSr""""'^^^^^ 

Example 11 

«n«35nnT^r' ^"^^^ '^^^^ "«""'acture by assembly of nucleotide bases of two structural qenes 
! i^fn I , T ''"T" ^"^"^ °' ^ incorporating, respectively 'preferred^^s Ici IxpTs! 
1^ R T ?f IT (^■'^'^'''^^>'^^«^ Also described is the constmction of genes encodhg^s^f huZ 

LTh r. ^' ""^'^ ''^^'^"^ ^^^""^ component oligonucleotides into multiple duplexes 

wfuch.,n turn were assembled into three discrete sections. These 

irsiTcrrvrr""'^^^"^'"'^ 

u.J^SI^ ^^^'^ '""^'^'^ ^^^'9" ^ ^«««'"'"y °' ^ manufactured gene encoding a human EPO 

Le7?h« n ' ""'r' °^ Presequence bu, including an initial methionine residue ^Sto^Tj^^ 

to ^ tSe -ECETO- g^r' ""'"^"^^ '^-^^ ^ tSe ref^^r^ 
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TA8LE VIII 
ECEPO SECTION 1 OLIGONUCLEOTIDES 



1. 


AATTCTAGAAACCATGAGGGTAATAAAATA 


2. 


CCATTATTTTATTACCCTCATGGTTTCTAG 


3. 


ATGGCTCCGCCCCGTCTGATCTGCGAC 


4. 


CTCGAGTCGCAGATCAGACCCGGCGGAG 


5. 


TCGAGAGTTCTGGAACGTTACCTGCTG 


6. 


CTTCCAGCAGGTAACGTTCCAGAACT 


7. 


GAAGCTAAAGAAGCTGAAAACATC 


8. 


GTGGTGATGJTTTCAGCTTCTTTAG 


9. 


ACCACTGGTTGTGCTGAACACTGTTC 


10. 


CAAAGAACAGTGTTCAGCACAACCA 


11. 


TT TGAACGAA AACATTACGGTACCG 


12. 


GATCCGGTACCGTAATGTTTTCGTT 



TABLE IX 
ECEPO SECTION 1 

Xba l 

EcoRI 1 _ 

'"''^llrc rirrrrlrii ^Hle*^*'*'** Ta|atGGCTCC GCCGCGTCTG 
GAFC TTTGGTACTC CCAFTATTTT ATT55qGAGG CGGCGCAGAC 

- *. 

ATCTGCGAc tr CGAGA GTtcT GGAACGTTAC CTGCTgIgAAG CTAAArAACr 
TAGACGCTGr5UTc|rCAAGA CCTTGCAATC GACGASfrq M?ff??KG 

f g?Ig Ifiifihr?!! ctgttcItttg aacgaS^aca 

ACTTTTGTAG TGGTCpCCAA CACGACTTQT GACAAGASScI TTGCTTTTGT 
o 10 ' 



Kon I BaroH I 
TTACGGTACC G 
AATGCCATGG CCTAG 
12 



32 



EP 0148 605 B2 



ECEPO 


TABLE X 
SECTION 2 OLIGONUCLEOTIDES 


1. 


AATTCGGTflCCAGACACCAAGGT 


2. 


GTTAACCTTGGTGTCTGGTACCG 


3. 


TAACTTCTACGCTTGGAAACGTAT 


4. 


TTCCATACGTTTCCAAGCGTAGAA 


5. 


GGAAGTTGGTCAACAAGCAGTTGAAGT 


6. 


CCAAACTTCAACTCCTTGTTGACCAAC 


7. 


TTGGCAGGGTCTGGCACTGCTGAGCG 


6. 


GCCTCGCTCAGCAGTGCCAGACCCTG 


9. . 


AGGCTGTACTGCGTGGCCAGGCA 


10. 


GCAGTGCCTGGCCACGCAGTACA 


11. 


CTGCTGGTAAACTCCTCTCAGCCGT 


12. 


TTCCCACCGCTGAGAGGAGTTTACCA 


13. 


GGGAACCGCTGCAGCTGCATGTTCAC 


14. 


GCTTTGTCAACATGCAGCTGCAGCGG 


15. 


AAACCAGTATCTGGCCTGAGATCTG 


16. 


GATCCAGATCTCAGGCCAGATACT 
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TABLE XI 
ECEPO SECTIOK 2 
EcoRI Kpn l i 3 

A ATTCGGTACC AGACACCAAG GT ITAACT TCT ACGCTTGGAA ACGTATbcAA 
GCCATGG TCTGTGGTTC CAjTfT^AGA TCCGAACCTT TGCATAKTt| 

GTrCGTCAAC AAGCAGTTGA AGThrGGCAG GGTCTGGCAC TGCTGAGcA 
CAACCAGTTfi TTCGTCAACT TCAA^TC CCAGACCGTG ACGACTCGCT 

§. . a ■ 

GGCTGTACTG CGTGGCCAGG CA btGCT GCT AAACTCCTCT CAGCCGThcG 
CCGfeCATGAC GCACCGGTCC GTGACiancCA TTTGAGCAGA CTCGGCACUC 
10 I 12 

AACCGCTgiA 6CTGCATGTT GAc|ftAAGCAG TATCtIcCCT GAgIt^TG 
nlGCCGACGT CCACGTACAA GTGTTTCGirc ATAGACCGGA CTCTAGACCTAC 
M ' 16 
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table xii 
ecepo section 3 
gatccagatctctgactactctgc 
acgcagcagagtagtcagacatctg 
tgcgtgctctgggtgcacagaaagagg" 

GATAGCCTCTTTCTGTGCACCCAGAGC 

CTATCTCTCCGCCGGATGCTGCATCT 

CAGCAGATGCAGCATCCGGCGGAGA 

GCTGCACCGCTGCGTACCATCACTG 

ATCAGCAGTGATGGTACGCAGCGGTG 

CTGATACCTTCCGCAAACTGTTTCG 

ATACACGAAACAGTTTGCGGAAGGT 

TGTATACTCTAACTTCCTGCGTGGTA 

CAGTTTACCACGCAGGAAGTTAGAGT 

AACTGAAACTGTATACTGGCGAAGC 

ggcatgcttcgccagtatacagttt 
atgccgtactggtgaccgctaatag 
tcgactattaccggtcaccagtac 
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TABLE XIII 
ECEPO SECTION 3 



BamHl aqlll 
GA TCCAGATCTCTG 
GTCFAGAGAC 



i L 2 .5 

rrlrnlrlnr Inn^U^l^l GGGTGCACAG AAAGAGG bTA TC TCTCCGCC 
TGATGAGACG S^GeAjlGAGA CCCACGTGTC TTTCTCCGAT^GAGCCGG 

GGATGCTGCA TClbCTGCAC CGCTGCGTAC CATCACTcfcT GATACCTTcC 
CCTACGACGT AGAPgAqSTG GCGACGCATG GTAGTGACGTTT^TGGAAGG 



GCAAACTGTT 
CGTTTGACAA 
10 



TCG tTGTATA C 
AGCACATA|r 



n 

TCTAACTTCC 



- TGCGTGGTA h 
G AGATTGAAGG ACGCACCATrnr^AqTTTGAC 



13 

ACTGAAACTG 



11 



15 

Irlrnln^^^ ^^GCfaTGCCG TACTGGTGAC CGCTAATAG 
ATATGACCGC TTCGTA&raC ATGACCACTG GCGATTATC 
11 16 



Sai l 
AGCT 
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TABLE XIV 
ECEPO GENE 



-1 1 
MetAla 



ff?«fic°fc° c«?;j?f?; i??L°^"" 

"uuiMLic CCflTTATTTT ATTACCGAGG CGGCCCAGAC 
ATCTGCGACT CGAGAGTTCT GGAACGTTAC rTrrTrr«<„. 

r«c«cccTc. ocrcTcc. ccJSJJfc cJSc"?? Slfffc^fc" 

fSii ?K ?;sc=ff??s; 

IS c^JSj ™j ,™ 

-™ ™c=. 

SJc= §s??e= SfffiJ 
f?Kf ssj ?f§;s ??Kc 
ffl; ?cw? ?ffs 
ffls 

GCAAACTGTT TCGTGTATAC TCTAACTTCC TCCrTrrTan noT^,» 
CGTTTGACAA AGCACATATG AGAnGlJcG iSccJ?? f^Jc'S^fcJ? 

TATiiCTGGCG AAGCATGCCG TACTGGTGflr rrrroflTc^^^ 
ATATGACCGC TTCGTACGGC ar llllTjl lllljlVj^, CCT 

site. Section 1 could readily be arriDlified u^inn iho miq ^k/ V . ^ '® ® recognition 
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fragment was thereafter readily isolated. 

oi laoies X and XII, respectively. Each section was amplified In the Ml 3 vector employed for seauence verification 

l^J^- T ^ ^ '^'^ ^ Sif^ilariy. ECEPO Section 3 was prepared with 

fia/nHI and Sail sticky ends and could be isolated from phage RF DNA as a BgM/Sali fragment The thre^^ct^s 
thus prepared can readily be assembled Into a continuous DNA sequence (S)le XIV) encSnn thV^r^^^^n 

.h^ S:e:;;e^r l^^V^^^^^^^ ^^'^ ^"'^•^""^■^ ^^^-^ ^-^^ ^^'^ sequence of 

of thtlc1?Sa3^t!r''°' ^^^^ 

SmTfif '^'"P^^^^"^^ Plasmid PCFM536 _ a derivative of plasmid pCFM414 (A TC C 

40076) _ as described in co-pending U.S. Patent Application Serial No. 636,727 filed Auaust 6 1 984 bv ChJiS p 
Morris. More specificalV. pCFM536 was digested with Xtel and HMU; theSge fmgmem J^^^^^^^^ 

ma,b. Me,M o, IK. o,,,, ™p,.s.o. ge.e (such =s pM*a « £„/,sl«n K,24«w 



A. [Asn2, des-Pro2 through lle^JhEPO 



.he ^st base o, the codon encoding AspB through the sToS^re Of 

was manufactured having the following sequence: cooon. a Afial/x/wl linker sequence 



Xbal +1 



2 


7 


S 


Asn 


Cys 


Asp 


AAT 


TGC 


GAC 


TTA 


ACG 


CTG 



Xhol 



B. [HIsTJhEPO 

the f^rgfe^uTr^e;'"'"^ ^'^ ^ ^'^^'^ -s manufactured having 



Xbal +1 



4 


5 


6 


7 


Arg 


Leu 


He 


His 


CGT 


CTG 


ATC 


CAT 


GCA 


GAC 


TAG 


GTA 



Xhol 
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oligonucleotides 1 of Section 2 in each gene. 





TABLE XV 


SCEPO 


SECTION 1 OLIGONUCLFnTTHP*: 


1. 


AATTCAAGCTTCGATAAAAGAGCT 


2, 


GTCGAGCrCTTTTATCCAAGCTTG 


3. 


CCACCAAGATTGATCTGTGACTC 


4. 


TCTCGAGTCACAGATCAATCTTG 


5. 


GAGAGTTTT6GAAAGATACTTCTTG 


6. 


CTTCCAACAAGTATCTTTCCAAAAC 


7. 


GAAGCTAAAGAAGCTGAAAACATC 


8. 


GTGGTGATGTTTTCAGCTTCTTTAG 


9. 


ACCACTGGTTGTGCTGAACACtGTTC 


10. 


CAAAGAACAGTGTTCAGCACAACCA 


11. 


TTTGAACCAAAACATTACGGTACCG 


12. 


GATCCGGTACCGTAATCTTTTCGTT 



TABLE XVI 
SCEPO SECTIOM 1 

EC OR I riind lll l 
AATTCA AGCTTGGATA 
GT TCGAACCTAT 

2 

AAAGAGCTtc_ACCAAGlTTG ATCTGTGACT cbACarTTTT 
TTTCTCGAgTT^TTCTAAC TAGAM??m G?f§TfcJwJ 

c«i?s iiiii^ iird^^ i?f??;g?ra ^iii 

cJcGlc?T§f lim^^ AACGAS^ACA TTAC^fr^CC G^ ' 

cacgacttgt gacaagaaaC^ ttgcttttgt aatgccatgg cctag 

12 
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TABLE XVII 




SECTION 2 OLIGONUCLEOTIDES 


1. 


AATTCGGTACCAGACACCAAGGT 


2. 


GTTAACCTTGGTGTCTGGTACCG 


3. 


TAACTTCTACGCTTGGAAACGTAT 


4. 


TTCCATACGTTTCCAAGCGTAGAA 


5. 


GGAAGTTGCrCAACAAGCAGTTGAAGT 


6. 


CCAAACTTCAACTGCTTGTTGACCAAC 


7. 


TTGGCAAGGTTTGGCCTTGTTATCTG 


8. 


GCTTCAGATAACAAGGCCAAACCTTG 


9. 


AAGCTGTTTTGAGAGGTCAAGCCT 


10. 


AACAAGGCTTGACCTCTCAAAACA 


11. 


TGTTGGTTAACTCTTCTCAACCATGGG 


12. 


TGGTTCCCATGGTTGAGAAGAGTTAACC 


13. 


AACCATT6CAATTGCACGTCGAT 


14. 


CTTTATCGACGTGCAATTGCAA 


15. 


AAAGCCGTCTCTGGTTTGAGATCTG 


16. 


GATCCAGATCTCAAACCAGAGACGG 
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TABLE XVIII 
SCEPO SECTION 2 



Kpn l 
EcoRI 1 

A ATTCGGTACC AGACACCAAG 
GCCATGG TCTGTGGTTC 
2 



,2 5 
GTfrAACTTCT ACGCTTGGAA ACGTAThcAA GTTGGTCAAC AAGCTrrTra 
CAfeTT^AGA TGCGAACCTT TGCATA^ CAaEcAGtVg fJwI^Iicf 



6 



TCCpCAAAAC TCTCCAGTTC 
10 



CCTfrGTTGGT TAACTCTTCT CAACCATGGG McCATTGCF^ATTrrflrrTr 

ggaIo^ca attgagaaga gttggtaccc fTEE^IcG? flJccf^MG 



. 15 BolII BamHI 

GAT hAAGC CG TCTCTGGTTT GAGATCTG 

CTATTTCpGC AGAGACCAAA CTCTAGACCTA G 
X 6 
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TABLE XIX 


SC£PO 


SECTION 3 OLIGONUCLEOTIDES 


1. 


GATCCAGATCTTTGACTACTTTGTT 


2. 


TCTCAACAAAGTAGTCAAACATCTG 


3. 


GAGAGCTTTGGGTGCTCAAAAGGAAG 


4. 


ATCCCTTCCTTTTGAGCACCCAAAGC 


5. 


CCATTTCCCCACCAGACGCTGCTT 


6. 


GCAGAAGCAGCGTCTGGTGGGGAA 


7. 


CTGCCGCTCCATTGAGAACCATC 


8. 


CAGTGATGGTTCTCAATGGACCG 


9, 


ACTGCTCATACCTTCAGAAACTT 


10. 


GAATAACTTTCTCAAGGTATCAG 


11- 


ATTCAGAGTTTACTCCAACTTCT 


12. 


CTCAAGAAGTTGGAGTAAACTCT 


13. 


TGAGAGGTAAATTGAAGTTGTACAC 


14. 


ACCGGTGTACAACTTCAATTTACCT 


15. 


CGGTGAAGCCTGTAGAACTGGT 


16. 


CTGTCACCAGTTCTACAGGCTTC 


17. 


GACAGATAAGCCCCACTGATAA 


18. 


GTTGTTATCAGTCGGGCTTAT 


19. 


CAACAGTGTAGATGTAACAAAG 


20. 


TCCACTTTGTTACATCTACACT 
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TABLE XX 
SCEPO SECTION 3 

BamHI BolII I 

GATC CAGATCTTT^ ACTACTTTGT TbAGAGCTTT 
GTCTAGAAAC^TGATGAAACA ACTU?|:CAAA 

rrrl^rl^T? ^^^SSSStCAJirTCCcf ACC AGACGCTGCT TCTGCCGCTC 
CCCACGAGTT TTCCTTCGSriJiAGGGGTGG TCTGCGACGA AGACG-GCGAG 

- i 

1^9 11 
CATTGAGAAC CATC ^CTGC T GATACCTTCA GAAAGTThTT CAGAGTTTAC 
GTAACTCTTG GTAGtSS^A CTATGGAAGT CTTTCAAfsOfrc?MAJ?G 
S- 10 12 

12 15 
TCCAACTTCT [[GAGAGGTAA ATTGAAGTTG TACACfccCTG AAGCCTGTAG 
AGGTTGAAGA feeT^TCCATT TAACTTCAAC ATCTclffaC TTCGGACA?? 



14 

17 19 



T^rlrrlfe^-rftJISSS'^^^ GACTGATAA^_AACAGTGTAG 
TTGACCAbTG TCpTTCGGG CTGACTATirTTaTCACATC 

18 ~ 



Sail 

ATGTAACAAA G 
TACATTGTTT CAGCT 
ZO 
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TABLE XXI 
SCEPO GENE 

-1 +1 
Hlnd lll ArgAla 

'^^^^ISoS^^ AAAGAGCTCC ACCAAGATTG ATCTGTGACT CGAGAGTTTT 
ACCTAT TTTCTCGAGG TGGTTCTAAC TAGACACTGA GCTCTCAAAA 

^S**^"^'^^ TTGTTGGAAG CTAAAGAAGC TGAAAACATC ACCACTGGTT 
CCTTTCTATG AACAACCTTC GATTTCTTCG ACTTTTGTAG TGGTGACCAA 

SI^^I^T^E* CTGTTCTTTG AACGAAAACA TTACGGTACC AGACACCAAG 
CACGACTTGT CACAAGAAAC TTCCTTTTGT AATGCCATGG TCTGTGCTTC 

rll???!!?! JS^II^SS^ ACGTATGGAA GTTGGTCAAC AAGCTGTTGA 
CAATTGAAGA TCCGAACCTT TGCATACCTT CAACCAGTTG TTCGACAACT 

t^llU^^tt TGTTATCTGA AGCTGTTTTG AGAGCTCAAG 

TCAAACCGTT CCAAACCGGA ACAATAGACT TCGACAAAAC TCTCCAGTTC 

CGTTGTTGGT TAACTCTTCT CAACCATGGG AACCATTGCA ATTGCACGTC 
GGAACAACCA ATTGAGAAGA GTTGGTACCC TTGGTAACGT TAACGTGCAG 

nrl^S^SSHS lEI^^^^^^^ GAGATCTTTG ACTACTTTGT TGAGAGCTTT 
CTATTTCGGC AGAGACCAAA CTCTAGAAAC TGATGAAACA ACTCTCGAAA 

GGGTGCTCAA AAGGAAGCCA TTTCCCCACC AGACGCTGCT TCTGCCGCTC 
CCCACGAGTT TTCCTTCGGT AAAGGGGTGG TCTGCGACGA AGACGGCGAG 

nSIIrx^r^S ^SIE2H^°'^^ CATACCTTCA GAAAGTTATT CAGAGTTTAC 
GTAACTCTTG GTAGTGACCA CTATGGAAGT CTTTCAATAA GTCTCAAATG 

TCCAACTTCT TGAGAGGTAA ATTGAAGTTG TACACCGGTG AAGCCTGTAG 
AGGTTGAAGA ACTCTCCATT TAACTTCAAC ATGTGGCCAC TTC6GACATC 

AACTGGTGAC AGATAAGCCC GACTGATAAC AACAGTGTAG 
TTGACCACTG TCTATTCGGG CTGACTATTG TTGTCACATC 

Sail 

ATGTAACAAA C 
TACATTGTTT CAGCT 



The assembled SCEPO sections were sequenced in Ml 3 and Sections 1 , 2 and 3 were isolatabte from the phage 
as HinaWKpri/BGM, and Bg^VSaH fragments. 

The presently preferred expression system for SCEPO gene products is a secretion system based on S.cemvisiae 
a-tactor secretion, as described in co-pending U.S. Patent Application Serial No. 487.753. filed April 22. 1983. by Grant 
A. Bitter, published October 31. 1984 as European Patent Applicalion 0 123 294. Briefly put. the system involves 
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te^er or sSZ?: «^°9!~"» 9«"« ^ ^^^ressed. As a resu«. the gene product translated includes a 
-^TnnlTtK ^^'^J^"'' -processed off- by an endogenous yeast enzyme in the course of secretion of the 
3o nl2 to rSn?- '=°"f --^^ °' tf^e -'actor translation init«,Kx, (ATG) codon.Lre 

Zin! r if ^ ^ ^' "'^ P^""*"" °' S^EPO 9«"«- As may be noted from Table xkl the 

LTl f gene expression »,voVed a four-part ligation including the above-noted SCEPO section fragments and 

Sfo^mii^'^^^""^ ^''T^ ^'9°^^*°" With ferpHl'and ligated in o ^^Li di- 

gesiea plasmid pYE to form expression plasmid pYE/SCEPO. 

Example 12 

The present example relates to expression of recombinant products of the manufactured ECEPO and SCEPO 
genes within the expression systems of Example 11 '^ciurea totnu and SCEPO 

<:»\'"»te™»M.l,sM,bfol,.ow«hF.en*P,e8, (10,000 p=l) and mai«n»^^ 40fl. 
M,g«^ The M., reining l,om 24.0CK) X9 cenlfflugallon «bl£<i „m 9^S«rHS^,SSM;:S^!; 
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As previously indicated, recombinant-produced and synthetic products of the invention share, to varying degrees 

Inh 'r,T ^ ■ ^1^"^ °' '^'^'^ ^'""'^^ consequently are projected to have ut^ty as 

subst tmes for EPO isolates in culture media enripldyed for growth of erythropoietic cells in culture. Similarly, to the 
e^entthat polypeptide productsoftheinventionsharethe/nmoactivityof^^^^ 

suiteble for use in erythropoietin therapy procedures practiced on mammals, including humans, to develop any or aH 

nf h!S^i K ^t^^^^^r T ^^^^ ^ "^""^ ""^ ««^*^)- eOrthrocyte mass changes, stimulation 

Tl^ l ^rPTZ-^T '"P'^>^ '"'^'^ ^'^'"P'^ i"^'e««*"g hematocrit levels 

f "T* f'"^ ^'^""^ ^"'9'^' P^"^"'^' patients including dialysis patients 

a^S «tTrTK'°'''°*^'°^^ 

anemias, and the like. The minimization of the needfortransfusion therapy through use of EPOtherapy can be expected 

o enhf'-J' '° °' '""'"""^ substances, and the like, and are thus likeTTo 

provide enhanced overall effeclr^eness in therapeutic processes vis-a-vis naturally derived products Erythropoiel n 

T^'^ '"'"""^ ^''P^^'^ '° the enhancement of oxygen carr^i^g 

cS eKr «"«'""'e"n9 hypoxic environmental conditions and possib^ in providing benefic^l cardicS^as 

A preferred method for administration of polypeptide products of the invention is by parenteral (e g IV IM SC 
Tn iTlZr^ i "'Tk!"'!"^ administered would ordinarily include therapeutically effective amounts oi produci 
Lr^S^ 1^ ^"""'"^^ pharmacokinetic studies indicate a long- 

^hln, 71 . ^'"^^^^'^ rather than IV. Effective dosages are expected to 

n ,~7u 

^l^li^I . -TOOOU) ng/kg body weight of the active material. Standard diluents such as human serum 
m^^lT^T^'^^lZ^^^ compositons of the invention, as are standard carriers such as saline 

arvthri^i . ' "'^ «»rnpositions of the inventbn Include compounds independently noted for 

Kn?«l^ 7 ^^stoete^onee. Progenitor cell stimulators, insulin^ike growth factor, prostag- 

landir^s. serotonin, cycte AMP. prolactin and triiodothyronine, as well as agents generally employed in treaim^r^fof 

S»o^^f?flS; ^' '^"^"^y'"'- ^^(2>- 437-444 (1984): Paviovic-Kantera. atal.. ExpL Hematol 

a2,;l mvroid h^^^^^ '° '"T' ""^'^ ^ryt^'ropoietin or asialo-EPO. such as the adrenergic 

^^(ci.S^^l^TT.^?^''' ^°"<=^P'^ Erythropoiesis- John Wiley and 

( 982) Shahid^ New Eng. J. Med.. 289. 72-80 (1973); Fisher, at al.,. Steroids. 30(6). 833-845 (1977)- Urabe at 
a .. J. Exp Med.. U9. 1314-1325 (1979); and Billat, et al., Expt. Hemato,.. 10(1). 13^-140 (1982)1 2 we asVe 

( 983)] and eryth^tropins" [as described by Congote. at al. in Abstract 364 Proceedings 7th Internal c^al Congress 

Lr.^'T^S^'lofTfSS; ?V^^^^ (^««^>^ -erythrogenin^-ITs described in'Rothmtritll!? 
hvrSvir!^;^^' <^^^2)]. Preliminary screenings designed to measure erythropoietic responses of ex- 

of the present invention have generated equivocal results. 'y«"opoieiin 

form?i?rSL"vf ^ °' P°'yP^P"''^« °' are similarly extensive and include use in labelled and unlabelled 

IXT °' u^munoassay techniques including RIA's, ELISA's and the like, as well as a variety of Wrro and^n 

15&-156 (1984). Krystal. Expt Hematol., 11 7). 649-660 (1983); Saito, et al.. Jap. J Med 23(1) 15-21 n984V 
%LTZ t references pertaSglJLays refiS 

TZ^l^y^ ^ l^^ °' '^.^ ^^"'^^"^ P^P«^^^ comprising sequences of residuJot EPO firs° 

Z anlbTestLTTnT^ ""'^^'^ ^^"^'^''"S polyclonal antibodies and "banks- of mo^S 

^Tl^iirIf.?Jl ^P^'^*"!"' '*'"«""9 continuous and discontinuous epitopes of EPO. As one example, preliminaor 
anahrsis <rf the a-rnno ac*^ n^^ ^ ^ ^^^^^ hydropathicity according to Hopp. at al , PNA S 

25 (1 978) revealed that synthetic peptides dupfcative of continuous sequences of residues spanning positions 41 % 

Z^^^f polyclonal antibodies immunoreactive with both the synthetic peptide and the entir? pr^n sS, " 
no Smed h "^^'"^ ^ ^"'""^ °' ^''^ ^'^'^ EPCrelated pr<JlucS but S 
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Illustratively, the foltowing three synthetic peptides were prepared: 

(1) hEPO 41-57. V-P-D-T-K-V-N-F-Y-A-W-K-R-M-E-V-G- 

(2) hEPO 116-128. K-E-A-l-S-P-P-D-A-A-S-A-A- 

(3) hEPO 144-166. V-Y-S-N-F-L-R-G-K-L-K-L-Y-T-G-E-A-C-R-T-G-O-R. 

' J ^'Tf • "° Wreclable response to hEPO 116-128. and a strong posttive response tohEPO 

144-146^ as nrieasured by capactty of rabbit serum antibodies to Immunopreclpitate ^^^l-iabeUed human urinary EPO 
SSLJ,""'"''' ^"'"^ °" ''"'^^ "° ^^Snificant activity either aZ or S 

ess21'^Z^S'l^f '^"'T"?' '^"'"^^ °' mammalian EPO provided by the illustrative examples 

Of S al« H confom«tion of mature EPO. H will be underetood that the specific sequence 

S r„onw,fhfr ? °' T'f ^ ^'^ " ^"'"^ ^ ^"'^ ^ °» «P«<='es EPO iS Table 

.h^ ^ri^r; ! "'^ P^'yP^P'''^^^ P'°^Wed by the invention. Comprehended by the present inventbn are 

retried to h^^^^^^^^^ ' interferon rndicates are likeV to exist. Compare, e.g.. the human immune interferon^e^ies 
Zrt^H o h '^'"'"^ No, 140 in EPO published application 0 077 670 and the species 

reponed to have glutam.ne at position No. 140 in Gray, et a!.. Nature. 295. pp. 503-508 (1982) Both species are 
Characterized as constituting "mature' human y interferon sequences.) Allelic forms of mature EPO pXept^rs mav 
TT."' ^ °' ^"^'^^ ^ ^' °' Of sequence anS termToI 

? ;e^;«ct r- Tr"' °' ^^^"^ °' ^■^''^^ '^^"Suent potential varSons 

!<^^u5f/S glycosylafon. As noted previous^, one putative allelic form of human species ipo is belieSdT 

IdLIL T r!f ^"P""'"'^'^- "««"'«''y-occurrin9 allelic forms JEPO-encodinrD^A and 

differing codons for designation of the same polypeptides as specified r /r k p'^y 

In addition to naturaHy-occurring allelic fomfis of mature EPO. the present invention also embraces other "EPO 

mTcSi 1 T'^? ^' one -"^y readify design and manufacture J^nS J^n^ tor 

EpiTn TlZoHh T ^''T"""" P''"'^ confon^ations which differ from that herein specified iT2Z 
anddel2ir/iHr T7°'^^^ 

wet^rsl ei rl^d m t '^'^ ^'^ -"^^ -^^''^ accomplished by 

ZZ I^lTr. ^ ^ °' *® biological properties of EPO but may differ in others As examoles 

,hT t?, sT.^""'' °' those Which are foreshortened by e g . deiSoislAsrfJ deT-pS 

Tant ^ exTn d^^^^^^^^ ''^^"''"'"^ ■A27-55hEPO-. the Jer having Se ^iSes .J^ed S 

S™S K ^'^■'^^""'"3 ^'^^^ °' "^'"^ "^"^ "^^^ 'o one or more a potential sites to 

Sas dl/ H T :!fK" yeasti,roduced products): or which have one or 2e cytte" 
H east itt°e^Tn'^^^^^ histidine or serine residues (such as the analog lHis7]hEPO) and are pote'n.iall 
nhol?oi • , ^ u '"^ ^y^'^'"^: °^ '^'^'^ f'^ve one or more tyrosine residues replaced bv 

torPO eZro ? 

uousTrr^TnT^^^s ' ''^^P'^''^"*^ are polypeptide fragments duplicating only a part of the contin 

TbpoZ fjrrr:' T°T'^ contormattons wrthin mature EPO. which fragments may possess one aS^ 
Ic ? l^^;-' °' "'""'"^^ ^""^ "'^ '^^'^ sfythropoietic activity). Especially significantin this reqard a e 

Table VI. i.e.. fragments' of the total continuous EPO sequence which are delineated by intron seouences and whirh 

oTrJorre^'pS:!:^^^^^^ 

o S ut!mi?n oth™^.,c K co^" " "^""^ "'"""''^^ °' Wfei'and. at al.. supra) 

ulet nrtreatm^n 1 ^ ^'^^ ^"^^^ °' antagonism. Antagonists of erythropoietin may be quite 

inrSrand °' ^'^ « 9 • '^-"'^on, Help. PracnZ. 16(12). 

a/ (1S83). and Hellman, atal., C/;a Lab. Haemal. 5. 335—342 (1983)] 

According to another aspect of the present invention, the cloned DNA sequences described herein which encode 
human and monkey EPO polypeptides are conspicuous^ valuable for the info.TT.atioo whichTey pXidTS^eSS^Q 
the ammo acid sequence of mammalian erythropoietin whfch has heretofore been unavailabte Se SSdes 
analytical processingof^latesof naturally-occurring products. The DNA s^^^^ 

as products useful in effecting the large scale mfcrobial synthesis of erythropoietin by a ^^e^^fJSnSLn 
n-ques. Put another way. DNA sequences provided by the invention are useful in genLting nrl^dTeS v^^^^^^^^ 
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circular plasmid DNA vectors, new and useful transformed and transf ected microbial procaryotic and eucaryotic host 
colls (Including bacterial and yeast cells and mammalian cells grown in culture), and new and useful methods for 
cultured growth of such microbial host cells capable of expression of EPO and EPO products. DNA sequences of the 
invention are also conspicuously suitable materials for use as labelled probes in isolating EPO and related protein 
encoding cDNA and genomic DNA sequences of mamnfialian species other than human and monkey species herein 
specifically illustrated. The extent to which DNA sequences of the invention will have use in various alternative methods 
of protein synthesis (e.g.. in insect cells) or in genetic therapy in humans and other mammals cannot yet be calculated. 
DNA sequences of the invention are expected to be useful in developing transgenic mammalian species which may 
serve as eucaryotic "hosts" for production of erythropoietin and erythropoietin products in quantity See generally 
Palmiter. et al., Science, 222(4625), 809—814 (1983). 

Viewed in this light, therefore, the specific disclosures of the illustrative examples are clearly not intended to be 
limiting upon the scope of the present invention and numerous modifications and variations are expected to occur to 
those skilled in the art. As one example, while DNA sequences provided by the illustrative examples include monkey 
cDNA and genomic DNA sequences, because this application provides amino acid sequence information essential to 
manufacture of DNA sequence, the inventton also comprehends such manufactured DNA sequences as may be con- 
structed based on knowledge of EPO amino acid sequences. These may code for EPO (as in Example 12) as well as 
for EPO fragments and EPO polypeptide analogs (i.e. , -EPO Products") which may share one or more biotogkal prop- 
erties of naturally-occurring EPO but not share others (or possess others to different degrees). 

DNA sequences provWed by the present invention are thus seen to comprehend all DNA sequences suitable for 
use in secunng expresston in a procaryotic or eucaryotic host cell of a polypeptide product having at least a part of the 
primary structural conformation and one or more of the biological properties of erythropoietin, and selected from among 
(a) the DNA sequences set out in Tables V and VI; (b) DNA sequences which hybridize to the DNA secuences defined 
in (a) or fragments thereof; and (c) DNA sequences which, but for the degeneracy of the genetic code, wouki hybridize 
to the DNA sequences defined in (a) and (b). It is noteworthy in this regard. for example, that existing allelic monkey 
and human EPO gene sequences and other mammalian species gene sequences are expected to hybridize to the 
sequences of Tables V and VI or to fragments thered. Further, but for the degeneracy of the genetic code the SCEPO 
and ECEPO genes and the manufactured or mutagenized cDNA or genomic DNA sequences encoding various EPO 
f ragments and anatogs would also hybridize to the above-menlbned DNA sequences. Such hybridizations could readily 
be carried out under the hybridization conditions described herein with respect to the initial isolatfon of the monkey 
and human EPO-encoding DNA or more stringent conditions, it desired to reduce background hybridization 

In a like manner, while the above examples illustrate the inventkxi of microbial expression of EPO products in the 
context of mammalian cell expresskxi of DNA inserted in a hybrid vector of bacterial plasmid and viral genomic origins 
a wide variety of expression systems are within the contemplation of the invention. Conspicuously comprehended are 
expression systems involving vectors of homogeneous origins applied to a variety of bacterial, yeast and mammalian 
cells in culture as well as to expression systems not involving vectors (such as calcium phosphate transfection of cells) 
In this regard, it will be understood that expression of, e.g.. monkey origin DNA in monkey host cells in culture and 
human host cells in culture, actually constitute instances of "exogenous" DNA expression inasmuch as the EPO DNA 
whose high level expression is sought woukJ not have its origins in the genome of the host. Expression systems of the 
invention further contemplate these practices resulting in cytoplasmic fomnatton of EPO products and accumulation of 
glycosylated and non-glycosylaled EPO products in host cell cytoplasm or membranes (e.g.. accumulation in bacterial 
penplasmic spaces) or in culture medium supematants as above illustrated, or in lather uncommon systems such as 
P. aeruginosa expression systems (described in Gray, at al.. Biotechnology. 2, pp. 161—165 (1984). 

Improved hybridization methodologies of the invention, while illustratively applied above to DNA/DNA hybridizatton 
screenings are equally applicable to RNA^NA and RNA/DNA screening. Mixed probe techniques as herein illustrated 
generally constitute a number of improvements in hybridization processes allowing for more rapid and reliable poly- 
nucleotide isolations. These many individual processing improvements include: improved colony transfer and mainte- 
nance procedures; use of nylon-based filters such as GeneScreen and GeneScreen Plus to allow reprobing with same 
filters and repeated use of the filter. applicatk>n of novel protease treatments [compared, e.g , to Taub at al Anal. 
Biochem.. 126, pp. 222-230 (1 982)J: use of very low indivWual concentratfons (on the order of 0.025 pi'comole) of a 
large number of mixed probes (e.g.. numbers in excess of 32); and. performing hybridizatton and post-hybridization 
steps under stnngent temperatures ctosely approaching (i.e.. within 4»C and preferably wHhIn 2°C away from) the 
lowest catoulated dissccation temperature of any of the mixed probes employed. These improvements combine to 
provide results whfch could not be expected to attend their use. This is amply illustrated by the fact that mixed probe 
procedures involving 4 times the number of probes ever before reported to have been successfully used in even cDNA 
screens on messenger RNA species of relatively low abundancy were successfully applied to the isolatton of a unique 
sequence gene m a genomic library screening of 1.500,000 phage plaques. This feat was accomplished essentially 
concurrently with the publication of the considered opinion of Anderson, at al.. supra, that mixed probe screening 
methods were "...impractical for isolation of mammalian protein genes when corresponding RNA's are unavailable 



48 



EP 0 148 605 B2 

Claims 



1. A DNA sequence for use in securing expression in a procaryotic or eucaiyotic host cell of a polypeptide product 
having a least part of the primaiy structural confirmation (s/c| of that of eiythropoietin to allow possession of the 
biological property of causing bone marrow cells to increase production of reticulocytes and red blood cells and 
to increase hemoglobin m synthesis or iron uptake, said DNA sequence selected from the gioup consisting of: 

(a) the DNA sequences set out in Tables V and VI or their complementary strands; 

il21«!f?"!!i''w"f ^^"""""^ ""^^ ^^""^^"^ conditions to the protein coding regions of the DNA se- 
quences defined m (a) or fragments thereof; and 

2. A DNA sequence according to Claim 1 encoding human erythropoietin. 

3. A cDNA sequence according to Claim 1 being a monkey species erythropoietin coding DNA sequence. 

4. A DNA sequence according to Claim 3 and including the protein coding region set forth in Table V. 

5. A genome DNA sequence according to Claim 1 or 2. 

6. A human species erythropoietin coding DNA sequence according to Claim 5. 

7. A DNA sequence according to Claim 6 and including the protein coding regk)n set forth in Table VI. 

8. A DNA sequence according to Claim 1 or 2. covalently associated with a delectable label substance. 

9. A DNA sequence according to Claim 8, wherein the detectable label is a radkjiabel. 

10. A single-strand DNA sequence according to Claim 8 or 9. 

11. ADNAsequenceacroidingtoClaiml. coding for (Phei5]hEP04Phe49]hEPOJ^^^^ fHis^lhEPO rw 
de8-Pro2 through IteejhEPO. [des-Thriea through Argi66)hEPO. orlA27-55]hEPO ' 

aaTsT? 3°6 %T^f" host cell transformed or transfected with a DNA sequence ac^oFding to any one of 
Claims 1 . 2, 3. 6. 7 and 8. in a manner allowing the host cell to express said polypeptkJe product 

13. Aj^ransformedortransfectedhostcellaccordingtoCbim 12w^^^ 

14. A transfomned or transfected mammalian host cell according to Claim 13. 

15. A transformed or transfected COS cell according to Claim 13. 

16. A transformed or transfected CHO cell according to Claim 1 3. 

18. A procan^otfc or eucaryotic host cell stably transfom»d or transfected with a DNA vector according to Claim 17. 

t^S^^"Z27S^^T^ 'rTT °* "'"^^ conformation of human or monkey erythro- 

^nnl JL K °' ^^"'^ ^ °' ^"^''"^ °f derivative thereof possessing the bioloqical 

TmlTnh "^Z"^ .^" '"^''^^ ^ retfculocytes and r^k.^ cells to incS 

enous DNA sequence and which has higher molecuiar weight by SDS-PAGE from erythropoietin isolated from 
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urinary sources. 

20, A glycoprotein polypeptide according to Claim 1 9 having an average carbohydrate composition which differs from 
that ol human erythropoietin isolated from urinary sources. 

21. A polypeptide according to Claim 19 or 20 wherein the exogenous sequence is a cDNA sequence. 

2Z A polypeptide according to Claim 19 or 20 wherein the exogenous DNA sequence is a genomic DNA sequence. 

"> 23. A polypeptide according to Claim 19 or 20 wherein the exogenous DNA sequence is carried on an autonomously 
replicating circular DNA plasmid or viral vector. 

24. A polypeptide according to any one of Cteims 1 9 to 23 further characterized by being covalently associated with 
a detectable label substance. 

IS 

25. A polypeptide according to Claim 24. wherein said detectable label is a radiolabel. 

26. A polypeptide product of the expression in a eucaryotic host cell of a DNA sequence according to any of Claims 
1, 2, 3, 5, 6 and 7. 

27. A pr-ocess for production of a poVpeptide having at least part of the primary structural conformation of erythropoietin 
to allow possession of the biological property of causing bone marrow cells to increase production of reticulocytes 
and red blood cells and to increase hemoglobin synthesis or Iron uptake, which process is characterized by cuNuring 
under suitable nutnent conditions a procaryotic or eucaryotic host cell transformed or transfected with a DNA 
sequence according to any of Claims 1 , 2. 3. 5. 6 and 7 in a manner allowing the host cell to express said polypep- 
tide; and optionally isolating the desired polypeptide product of the expression of the DNA sequence. 

28. A process according to Claim 27. characterized by culturing a host cell of any one of Claims 12 to 16. 

30 29. A process according to Claim 27 or 28 for production of a polypeptide of any one of Claims 1 9 to 23 and 26. 

30. A pharmaceutical composition comprising a polypeptide produced In accordance with the process of Claim 27 28 
or 29 and a pharmaceutically acceptable diluent, adjuvant or carrier. 

3S 31 . A phamnaceutioal composition according to Claim 30. comprising a polypeptide of any one of Claims 1 9 to 23 and 

26. 
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Patentanspruche 

1. DNA-Sequenz. die dafOr verwendet wird. Expression in einer prokaryontischen Oder eukaryontischen WIrtszelle 
eines Polypeptidprodukts zu erreichen. das zumindest teilweise die Primarstrukturkonfoimation von Erythropoietin 
aufweist. urn den Besitz der biologischen Eigenschaft zu ermoglichen. Knochenmarkszellen zu veranlassen die 
ProduktK>n von Retikulozyten und roten Blutkorperchenzu steigern und Hamogtobinsyntheseoder Eisenaulnahme 
zu steigern. wobei besagte DNA-Sequenz aus der Gruppe ausgewahit ist. die aus 

(a) den in den Tabellen V und VI dargelegten DNA-Sequenzen Oder deren Komplementarstrangen- 

(b) DNA-Sequenzen. die unter stringenten Bedingungen zu den proteincodierenden Bereichen der' in (a) de- 
finierten DNA-Sequenzen oder Fragmenten derselben hybridisieren; und 

(c) DNA-Sequenzen, die ohne die Entartung des genetlschen Codes zu den in (a) und (b) definierten DNA- 
Sequenzen hybridisieren wQrden; 

besteht. 

2. DNA-Sequenz nach Anspruch 1 , die Human-Erythropoietin codiert. 

3. cDNA-Sequenz gemaO Anspruch 1 . die ein Erythropoietin einer Affenart codiert. 
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4. DNA-Sequenz nach Anspruch 3. die den in Tabelte V dargestelften proteincodierenden Bereich einschlieBt. 

5. Genomische DNA-Sequenz nach Anspruch 1 Oder 2. 

6. DNA-Sequenz nach Anspruch 5, die ein Erythropoietin einer menschlichen Spezies codiert. 

7. DNA-Sequenz nach Anspruch 6, die den In Tabelle VI dargestelften proteincodierenden Bereich einschlieRt. 

8. DNA-Sequenz nach Anspruch 1 oder 2. die kovalent mit einer nachweisbaren Marklerungssubstanz verbunden ist. 

9. DNA-Sequenz nach Anspruch 8. dadurch gekennzeichnet. daB die nachweisbare IVIarkierung eine radbaktive 
Markierung ist. 

10. Einzelstrangige DNA-Sequenz nach Anspnjch 8 Oder 9. 

11. DNA-Sequenz nach Anspruch 1 , die fur [PheisjhEPO. [Phe49]hEPO, [Phei^sjhEPO. [His^lhEPO. f Asn^ des - Pro2 
bis lle6]hEPO. [des - Thriw bis ArgieejhEPO Oder [A27-55]hEPO codiert. 

12. Prokaryontische oder eukaryontische Wirtszelle. transformlert oder transfizlert mIt einer DNA-Sequenz nach einem 
der Anspruche 1. 2, 3, 6. 7 und 8, in einer Weise. die es der Wirtszelle eriaubt. besagtes Polypeptidprodukl zu 
exprimieren. r r- 

13. Transformierte oder transfizierte Wirtszelle nach Anspruch 12. dadurch gekennzeichnet, daB die Wirtszelle in der 
Lage ist, besagtes Polypeptid zu glykosylieren. 

14. Transformierte oder transfizierte Saugetier-Wirtszelle nach Anspruch 13. 

15. Transformierte oder transfizierte COS-Zelle nach Anspruch 13. 

16. Transformierte oder transfizierte CHO-Zelle nach Anspruch 13. 

17. BiologischfunktlonellerZlrkular-Plasmld-oderWus-DNA-Vektor.de^ 
Che 1 . 2, 3, 5. 6. 7 oder 11 einschlieBt. 

18. Prokaryontische oder eukaryontische Wirtszelle. die mit einem DNA-Vektor nach Anspruch 17 stabil transfomiert 
Oder transfiziert ist. 

19. Rekornblnantes Polypeptid. das einen Teil oder die Gesamtheit der Primarstrukturkonformation von Human- oder 
Affen-Erythropoietin. wie dargestellt In Tabelle VI oder Tabelle V. oder Irgendeine allelische V^riante oder ein De- 
riva^ desselben aufweist; das die biotoglsche Elgenschaft besitzt. Knochenmarkszellen zu veranlassen die Pro- 
duktion von Retikulozyten oder roten Blutkorperchen zu stelgem und Hamoglobinsynthese oder Elsenaufnahme 
zu steigem. dadurch gekennzeichnet. daB es das Produkt eukaryontlscher Expression einer exogenen DNA^e- 
queriz ist und welches gemaB SDS-PAGE ein hoheres Molekulargewicht hat als Erythropoietin, das aus Urin- 
Quellen isohert ist. 

20. Glykoproteln-Polypeptid nach Anspruch 19. das eine mittlere Kohlenhydratzusammensetzung aufweist die sich 
von derjenigen von Human-Erythropoietin unterscheidet. das aus UrinK^uellen isoliert ist. 

21. PolypeptkJ nach Anspruch 19 oder 20. dadurch gekennzeichnet. daB die exogene DNA-Sequenz eine cDNA- 
Sequenz ist. 

22. Polypeptid nach Anspruch 1 9 oder 20. dadurch gekennzeichnet. da6 die exogene DNA^equenz eine genomische 
UNA-Sequenz ist. 

23. Polypeptid nach Anspruch 19 oder 20. dadurch gekennzeichnet. daB die exogene DNA-Sequenz auf einem au- 
tonom replizierenden Zirkular-DNA-PlasmkJ- oder Virus-Vektor sitzt. 

24. Polypeptid nach einem der AnsprOche 1 9 bis 23. welter dadurch gekennzeichnet. daB es kovalent mit einer nach- 
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weisbaren Markierungssubstanz verbunden ist 

25. Polypeptid nach Anspruch 24, dadurch gekennzeichnet. daB besagte nachweisbare Markierung eine radbaktive 
Markierung ist. 

26. PolypeptWprodukt der Expresston einer eukaroyntischen Wirtszello einer DNA-Sequenz nach einom der AnsprO- 

che1,2. 3, 5,6und7. 

27. Vertahren zur Herstellung eines Polypeptids, das zumindest leilweise die Primarstmkturkonformatlon von Erythro- 
poietin aufweist. urn den Besitz der btologischen Eigenschaft zu ermoglichen, Knochenmarkszellen zu veranlas- 
sen. die Produktion von Retikulozyten und roten Blutkorperchen zu steigem und Hamoglobinsynthese Oder Ei- 
senautnahme zu steigem. dadurch gekennzeichnet. daB eine prokaryontische Oder eukaryontische Wirtszelle die 
mit einer DNA-Sequenz nach einem der AnsprOche 1 . 2, 3, 5. 6 und 7 derart transfoimiert oder transfiziert woiijen 
ist. daB es der Wirtszelle ermoglicht ist. besagtes Polypeptid zu exprimieren, unter geeigneten Nahrbedingungen 
kultmen wird; und daB fakultativdas gewOnschte Polypeptidprodukt der Expression der DNA-Sequenz isoliert wird. 

2a Vertahren nach Anspruch 27. dadurch gekennzeichnet. daB eine Wirtszelle nach einem der AnsprOche 12 bis 16 
kultivierl wird. 

29. Vertahren nach Anspruch 27 Oder 28 zur Produktksn eines PolypeptWs nach einem der AnsprOche 1 9 bis 23 und 26. 

30. Phanmazeutische Zusammensetzung. die ein Polypeptid. das gemaB dem Vertahren nach Anspruch 27 28 oder 
29 produziert ist. und ein pharmazeutisch annehmbares VerdOnnungsmittel. Adjuvans oder Tragermittel umfaBt. 

31. Pharmazeutische Zusammensetzung nach Anspruch 30. die ein PolypeptkJ nach einem der AnsprOche 1 9 bis 23 
und 26 umfaBt. 



Revendlcatlons 

1. Sequence d'ADN ulillsable pour garantir fexpresskxi. dans une cellule hete procaryote ou eucaryote, d'un produit 
poiypeptidique ayant au nwins une partle de la confomration stnicturelle primaire de celle de r6ryrthropoI6tine pour 
u. permettre de poss^der la propri6t6 btokjgkjue de provoquer. par les cellules de la moelfe osseuse Caugmen- 
tation de la production des rdtculocytes et des globules rouges, et faugmentation de la synthtee tfhfimoglobine 
ou du captage de fer. ladite s^uence d'ADN 6tant choisie dans le groupe constitud de : 

a) les s6quences d'ADN indiquSes dans les tableaux V et VI ou leurs brins compMmentaires • 

b) des sequences d'ADN qui s'hybrident dans des condittons strides aux r^ions de codage de protdine des 
sequences d'ADN ddfinies dans (a) ou des fragments de celles-ci; et 

2*^M ^^"^^ "^'^^^ 'I"'- ^ d6g6n6rescence du code"g6n6tk,ue. s'hybrkJeiaient aux sequences 
d'ADN d^finies dans (a) et (b). 

2. S6quence d'ADN selon la revendteatton 1 . codant l'6rythropoi(6tine humaine. 

3. S6quence d'ADNc selon la revendfcation 1, qui est une sequence d'ADN codant MrythropoWtine de singe. 

4. S^uence d'ADN seton la revendteatton 3 et incluant la r^ion de codage de protdine indiqu6e dans le tableau V 

5. Sequence d'ADN g6nomkiue selon la revendfcatten 1 ou 2. 

6. S^uence d'ADN codant rSiythropoMtine humaine selon la revendfcatten 5. 

7. sequence d'ADN selon la revendcation 6 et incluant la region de codage de prot6ine indiqu6e dans le tableau VI. 

8. Sequence d'ADN selon la revendication 1 ou 2, associte de facon covalente k une substance de marquage 06- 

iGClaDI8. 

9. sequence d'ADN seton la revendfcatten 8, dans laquelle le marqueur detectable est un radiomarqueur. 
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10. Sequence d'ADN simple brin seton la revendication B ou 9. 

11. Sequence d'ADN selon la revendication 1 codant pour (Phe^^jEpo, (Phe^^hEPO, [Phei^sj^EPo. [His^JhEPO, 
[Asn2 des-Pro2 ^ ||e6]hEPO, [des-Thri^a ^ ArgiesjhEPO. ou [A27-55]hEPO. 

12. Cellule hole procaryote ou eucaryote transfonm6e ou transfect6e par une sequence d'ADN selon Tune quelconque 
des revendicatlons 1 . 2, 3, 6, 7 et 8, d'une fa^on permettant ^ la cellule h6te d'exprimer (edit prodult polypeptidique. 

13. Cellule hote transform^e ou transfect6e selon la revendication 12, laquelle cellule h&te est capable de glycosyler 
(edit polypeptide. 

14. Cellule hote de mammif6re transform6e ou transfect6e selon la revendication 13. 

15. Cellule COS transtormde ou transfectde sebn la revendication 13. 

16. Cellule CHO transformde ou transfect6e selon la revendication 13. 

17. Vecteur d'ADN viral ou de plasmlde circulaire biologlquement fonctionnel, incluant une sequence d'ADN selon une 
quelconque des revendicatlons 1, 2, 3, 5, 6, 7 ou 11. 

18. Cellule hote procaryote ou eucaryote transformde ou translectde de fagon stable par un vecteur d'ADN selon ta 
revendication 17. 

19. Polypeptide recombinant ayant tout ou partie de la conformation structurelle primaire de I'erythropoTetine humaine 
ou de singe, comme indlqu6e dans le tableau VI ou le tableau V. ou tout variant alldlique ou d6riv6 de celui-ci 
poss6dant la propri6t6 biologique de provoquer, par les cellules de la moelle osseuse. I'augmentation de la pro- 
duction des reticulocytes et des globules rouges et I'augmentation de la synthase d'hdmogloblne ou du captage 
de fer, et caract6ris6 en ce qu'll est le produit de I'expression eucaryote d'une sequence d'ADN exogfene et qui 
pr^sente un poids mol6culalre plus 6lev6 par SDS-PAGE k partir de l'6rythropoi6tine Isolde de sources urinaires. 

20. Polypeptide de glycoproteine selon la revendication 1 9 ayant une compositbn moyenne de carbohydrate qui dlff6re 
de celle de I'drythropoT^tine humaine Isolde de sources urinaires. 

21. Polypeptide selon la revendication 19 ou 20, dans lequel !a sequence exogdne est une sequence d'ADNc. 

22. Polypeptide selon la revendication 19 ou 20. dans lequel la s6quence d'ADN exogfene est une sequence d'ADN 
gdnomlque. 

23. Polypeptide sebn la revendication 19 ou 20, dans lequel la sequence d'ADN exogSne est port6e sur un vecteur 
viral ou de plasmida d'ADN circulaire se rdpliquant de fagon autonome. 

24. Polypeptide selon I'une quelconque des revendicatlons 19 & 23, caract6ris6 de plus en dtant associd de fagon 
covalente ^ une substance de marquage detectable. 

25. Polypeptide selon la revendication 24, dans lequel ledit marqueur detectable est un radiomarqueur 

26. Produit polypeptidique de Pexpression, dans une cellule h6te eucaryote, d'une sequence d'ADN sebn une quel- 
conque des revendicatbns 1 , 2, 3. 5 ,6 et 7. 

27. Proc6d6 pour produire un polypeptide ayant au moins une partie de la conformation structurelle primaire de 
I'erythropoietine pour lui permettre de possdder la proprietd bblogique de provoquer, par les cellules de la moelle 
osseuse, I'augmentation de la production des reticulocytes et des gbbules rouges, et j'augmentatlon de la synthase 
d'hemoglobine ou du captage de fer, lequel procede est caracterise par la mise en culture, dans des conditbns 
nutritives appropriees. d'une cellule h6te procaryote ou eucaryote transtorm6e ou transfectee avec une sequence 
d|ADN sebn une quelconque des revendicatlons 1, 2, 3, 5, 6 et 7 d'une maniere permettant d la cellule hote 
d'exprimer ledit polypeptide et. 6ventuellement. I'isolement du produit polypeptidique souhaite de I'expression de 
la sequence d'ADN. 
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28. Proc6d6 seton la revendication 27, caract6ris6 par la mise en culture d'une cellule h6te selon I'une quelconque 
des revendications 12^16. 

29. Proc6d6 selon la revendication 27 ou 28 pour produire un polypeptide selon rune quelconque des revendications 
19d23 et26. 

30. Composition pharmaceutique comprenant un polypeptide produit conform6ment au proc6d6 de la revendlcatbn 
27, 28 ou 29 et un support, adjuvant ou diluant pharmaceutiquement acceptable. 

31. Composition phamiaceutique selon la revendication 30, comprenant un polypeptide selon une quelconque des 
revendications 1 9 § 23 et 26. 
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